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ENZYMATIC DNA MOLECULES 

TECHNICAL FIELD 

The present Invention relates to nucleic acid enzymes or catalytic (enzymatic) 
5 DNA molecules that are capable of cleaving other nucleic acid molecules, particularly 

RNA. The present invention also relates to compositions containing the disclosed 
enzymatic DNA molecules and to methods of making and using such enzymes and 
compositions. 

PACKGRQUNP 

10 The need for catalysts that operate outside of their native context or which 

catalyze reactions that are not represented in nature has resulted in the development of 
"enzyme engineering" technology. The usual route taken in enzyme engineering has 
been a "rational design" approach, relying upon the understanding of natural enzymes to 
aid in the construction of new enzymes. Unfortunately, the state of proficiency in the 

1 5 areas of protein structure and chemistry is insufficient to make the generation of novel 

biological catalysts routine. 

Recently, a different approach for developing novel catalysts has been applied. 
This method invoh/es the construction of a heterogeneous pool of macromolecules and 
the application of an in vitro selection procedure to isolate molecules from the pool that 

20 catalyze the desired reaction. Selecting catalysts from a pool of macromolecules Is not 

dependent on a comprehensive understanding of their structural and chemical 
properties. Accordingly, this process has been dubbed "Irrational design" (Brenner and 
Lerner, PMAS USA 89 : 5381-5383 (1992)). 

Most efforts to date involving the rational design of enzymatic RNA molecules or 

25 ribozymes have not led to molecules with fundamentally new or improved catalytic 

function. However, the application of irrational design methods via a process we have 
described as "directed molecular evolution" or "in vitro evolution", which is patterned 
after Darwinian evolution of organisms in nature, has the potential to lead to the 
production of DNA molecules that have desirable functional characteristics. 

30 This technique has been applied with varying degrees of success to RNA 

molecules in solution (see, e.g.. Mills, et al., PNAS USA 59: 217 (1967); Green, et al.. 
Nature 347 : 406 11990); Chowrlra, et al., NatnrR 354: 320 (1991); Joyce, ggoe 82 : 83 

(1989) ; Beaudry and Joyce, Science 257 ! 635-641 (1992); Ftobertson and Joyce. 
Nature 344 ! 467 (1990)), as well as to RNAs bound to a ligand that Is attached to a 

35 solid support (Tuerk, et al.. Science 249 : 505 11990); Ellington, et al., Nflture 346 : 818 

(1990) ). It has also been applied to peptides attached directly to a solid support (Lam, 
et al.. Nature 354 : 82 (1991)1; and to peptide epitopes expressed within a viral coat 
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protein (Scott, et al., gclcncg 249: 386 (1990J; Devlin, et ak. Science 34a ! 249 (1990); 
Cwiria, et al., PNAS USA 87 : 6378 (1990)). 

It has been more than a decade since the discovery of catalytic RNA (Kruger, et 
al., Cell 31: 147-157 (1982); Gucrrier-Takada, et al.. Cell 35 : 849 857 (1983)). The list 
of known naturally-occurring ribozymes continues to grow (see Cech, in The RNA World 
Gesteland & Atkins (eds.), pp. 239-269, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY (1993); Pyle, Scienca^fil; 709-714 (1993); Symons, Curr. Qpin. 
Siructi BIpIi 4: 322-330 (1994)) and, in recent years, has been augmented by synthetic 
ribozymes obtained through in vitro evolution. (See, e.g., Joyce, Curr. Onin. struct. 
fiioLJ:: 331-336 (1994); Breaker & Joyce, Trends Bioterh. ^0 l 268-275 (1994); 
Chapman & Szostak, Curr. Qpin. Struct. RinI A- 618-622 (1994).) 

It seems reasonable to assume that DNA can have catalytic activity as well, 
considering that it contains most of the same functional groups as RNA. However, with 
the exception of certain viral genomes and replication intermediates, nearly all of the 
DNA in biological organisms occurs as a complete duplex, precluding it from adopting a 
complex secondary and tertiary structure. Thus it is not surprising that DNA enzymes 
have not been found in nature. 

Until the advent of the present invention, the design, synthesis and use of 
catalytic DNA molecules with nucleotlde-cleaving capabilities has not been disclosed or 
demonstrated. Therefore, the discoveries and inventions disclosed herein are 
particularly significant, in that they highlight the potential of in vitro evolution as a 
means of designing increasingly more efficient catalytic molecules, including enzymatic 
DNA molecules that cleave other nucleic acids, particularly RNA. 

BRIEF SUMMARY OP THg INVgMTIQM 
The present invention thus contemplates a synthetic or engineered (I.e., non- 
naturally-occurring) catalytic DNA molecule (or enzymatic DNA molecule) capable of 
cleaving a substrate nucleic acid (NA) sequence at a defined cleavage site. The 
invention also contemplates an enzymatic DNA molecule having an endonuclease 
activity. 

In one preferred variation, the endonuclease activity is specific for a nucleotide 
sequence defining a cleavage site comprising single-stranded nucleic add in a substrate 
nucleic acid sequence. In another preferred variation, the cleavage site is double- 
stranded nucleic acid. Similarly, substrate nucleic acid sequences may be single- 
stranded, double-stranded, partially single- or double-stranded, looped, or any 
combination thereof. 

In another contemplated embodiment, the substrate nucleic acid sequence 
includes one or more nucleotide analogues. In one variation, the substrate nucleic acid 
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sequence is a portion of, or attached to, a larger molecule. 

In various embodiments, the larger molecule is selected from the group 
consisting of RNA, modified RNA, DNA, modified DNA, nucleotide analogs, or 
composites thereof. In another example, the larger molecule comprises a composite of a 
5 nucleic acid sequence and a non-nucleic acid sequence. 

In another embodiment, the invention contemplates that a substrate nucleic acid 
sequence Includes one or more nucleotide analogs. A further variation contemplates 
that the single stranded nucleic acid comprises RNA, DNA, modified RNA. modified 
DNA, one or more nucleotide analogs, or any composite thereof. In one embodiment of 

1 0 the disclosed Invention, the endonuclease activity comprises hydrolytic cleavage of a 

phosphoester bond ai the cleavage site. 

In various preferred embodiments, the catalytic DNA molecules of the present 
invention are single-stranded In whole or in part. These catalytic DNA molecules may 
preferably assume a variety of shapes consistent with their catalytic activity. Thus, in 

1 5 one variation, a catalytic DNA molecule of the present invention includes one or more 

hairpin loop structures. In yet another variation, a catalytic DNA molecule may assume 
a shape similar to that of "hammerhead" ribozymcs. In still other embodiments, a 
catalytic DNA molecule may assume a conformation similar to that of Tetrahymena 
thermophffa ribozymes, e.g., those derived from group I introns, 

20 Similarly, preferred catalytic DNA molecules of the present Invention are able to 

demonstrate site-specific endonuclease activity irrespective of the original orientation of 
the substrate molecule. Thus, in one preferred embodiment, an enzymatic DNA 
molecule of the present invention is able to cleave a substrate nucleic acid sequence 
that is separate from the enzymatic DNA molecule - i.e., it is not linked to the 

25 DNAzyme. In another preferred embodiment, an enzymatic DNA molecule Is able to 

cleave an attached subsuate nucleic acW sequence - i.c„ It is able to perform a reaction 
similar to setf>cleavage. 

The invention also contemplates enzymatic DNA molecules (catalytic DNA 
molecules, deoxyrlbozymes or DNAzymesI having endonuclease activity, whereby the 

30 endonuclease activity requires the presence of a divalent cation. In various preferred, 

alternative embodiments, the divalent cation is selected from the group consisting of 
Pb'*, Mg**, Mn^*, Zn^*, and Ca'*. Another variation contemplates that the 
endonuclease activity requires the presence of a monovalent cation. In such alternative 
embodiments, the monovalent cation is preferably selected from the group consisting of 

35 Na* and K*. 

In various preferred embodiments of the Invention, an enzymatic DNA molecule 
comprises a nucleotide sequence selected from the group consisting of SEQ ID NO 3, 
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SEQ ID NO 14; SEQ ID NO 15; SEQ ID NO 16; SEQ ID NO 17; SEQ ID NO 18; SEQ ID 
NO 19; SEQ ID NO 20; SEQ ID NO 21; and SEQ ID NO 22. In other preferred 
embodiments, a catalytic DNA molecule of the present invention comprises a nucleotide 
sequence selected from the group consisting o1 SEO ID NO 23; SEQ ID NO 24; SEQ ID 
NO 25; SEQ ID NO 26; SEQ ID NO 27; SEQ ID NO 28; SEQ ID NO 29; SEQ ID NO 30; 
SEQ ID NO 31 ; SEQ ID NO 32; SEQ ID NO 33; SEQ ID NO 34; SEQ ID NO 35; SEQ ID 
NO 36; SEQ ID NO 37; SEQ ID NO 38; and SEQ ID NO 39. 

Another preferred embodiment contemplates that a catalytic DNA molecule of 
the present invention comprises a nucleotide sequence selected from the group 
consisting of SEQ ID NO 50 and SEQ ID NO 51 . In yet another preferred embodiment, a 
catalytic DNA molecule of the present invention comprises a nucleotide sequence 
selected from the group consisting of SEQ ID NOS 52 through 101 . As disclosed 
herein, catalytic DNA molecules havmg sequences substantially similar to those 
disclosed herein are also contemplated. Thus, a wide variety of substitutions, deletions, 
insertions, duplications and other mutations may be made to the within-described 
molecules in order to generate a variety of other useful enzymatic DNA molecules; as 
long as said molecules display site-specific cleavage activity as disclosed herein, they 
are within the boundaries of this disclosure. 

In a further variation of the present invention, an enzymatic DNA molecule of the 
present invention preferably has a substrate binding affinity of about 1 ^ or less. In 
another embodiment, an enzymatic DNA molecule of the present invention binds 
substrate with a Kq of less than about 0.1 ^vM. 

The present invention also discloses enzymatic DNA molecules having useful 
turnover rates. In one embodiment, the turnover rate is less than 5 hr'; in a preferred 
embodiment, the rate is less than about 2 hr^ in a more preferred embodiment the rate 
is (ess than about 1hr\- in an even more preferred embodiment, the turnover rate is 
about 0.6 hr* or less. 

In still another embodiment, an enzymatic DNA molecule of the present 
invention displays a useful turnover rate wherein the Is less than 1 min*\ preferably 
less than 0.1 min'\ more preferably, less than 0.01 min'*; and even more preferably, 
less than 0.005 m\r\'\ In one variation, the value of is approximately 0.002 min'^ or 
less. 

The present invention also contemplates embodiments in which the catalytic rate 
of the disclosed DNA enzymes is fully optimized. Thus, in various preferred 
embodiments, the K„ for reactions enhanced by the presence of Mg'^ is approximately 
0.5-20 mM, preferably about 1-10 mM, and more preferably about 2-5 mM, 

The present invention also contemplates an embodiment whereby the nucleotide 
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sequence defining the cleavage site comprises at least one nucleotide. In various other 
preferred embodiments, a catalytic DNA molecule of the present invention is able to 
recognize and cleave a nucleotide sequence defining a cleavage site of two or more 
nucleotides. 

5 In various preferred embodiments, an enzymatic DNA molecule of the present 

Invention comprises a conserved core flanked by one or more substrate binding regions. 
In one embodiment, an enzymatic DNA molecule Includes first and second substrate 
binding regions* In another embodiment, an enzymatic DNA molecule Includes two or 
more substrate binding regions. 

10 As noted previously, preferred catelytic DNA molecules of the present invention 

may also include a conserved core. In one preferred embodiment, the conserved core 
comprises one or mora conserved regions. In other preferred variationSr the one or more 
conserved regions include a nucleotide sequence selected from the group consisting of 
CG; CGA; AGCG; AGCCG; CAGCGAT; CTTGTTT; and CTTATTT (see, e.g.. Fig. 3). 

15 In one embodiment of the invention, an enzymatic DNA molecule of the present 

Invention further comprises one or more variable or spacer nucleotides between the 
conserved regions in the conserved core. In another embodiment, an enzymatic DNA 
molecule of the present invention further comprises one or more variable or spacer 
nucleotides between the conserved core and the substrate binding region. 

20 In one variation, the first substrate binding region preferably includes a 

nucleotide sequence selected from the group consisting of CATCTCT; GCTCT; 
TTGCTTTTT; TGTCTTCTC; TTGCT6CT; GCCATGCTTT (SEC ID NO 40); CTCTATTTCT 
(SEQ ID NO 41); GTCGGCA; CATCTCTTC; and ACTTCT. In another preferred variation, 
the second substrate binding region includes a nucleotide sequence selected from the 

25 group consisting of TATGTGACGCTA (SEO ID NO 42); TATAGTCGTA (SEQ ID NO 43); 

ATAGCGTATTA (SEQ ID NO 44); ATAGTTACGTCAT JSEQ ID NO 46); 
AATAGTGAAGTGTT (SEQ ID NO 46); TATAGTGTA; ATAGTCGGT; ATAGGCCCGGT 
{SEQ ID NO 47); AATAGTGAGGCTTG ISEO ID NO 48); and ATGNTG. 

In various embodiments of the present Invention, the substrate binding regions 

30 vary In length. Thus, for example, a substrate binding region may comprise a single 

nucleotide to dozens of nucleotides. However, it is understood that substrate binding 
regions of about 3-25 nucleotides in length, preferably about 3-15 nucleotides in length, 
and more preferably about 3-10 nucleotides in length are particularly preferred. In 
various embodiments, the individual nucleotides In the substrate binding regions are able 

35 to f orni complementary base pairs with the nucleotides of the substrate molacuies; In 
other embodiments, noncomplementery base pairs are formed. A mixture of 
complementary and noncomplementery base pairing is also contemplated as falling 
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wtthin the scope of the disclosed embodiments of the invention. 

In another preferred embodiment, a catalytic DNA molocuie of the present 
invention may further comprise a third substrate binding region. In some preferred 
embodiments, the third region includes a nucleotide sequence selected from the group 
consisting of TGTT; TGTTA; and T6TTA6. Another preferred embodiment of the 
present invention discloses an enzymatic DNA rrMlecule further comprising one or more 
variable or "spacer** regions between the substrate binding regions. 

In another disclosed embodimant, the present invention contemplates a purified, 
synthetic enzymatic DNA molecule separated from other DNA molecules and 
oligonucleotides, the enzymatic DNA molecule having an endonudease activity, wherein 
the endonudease activity is specific for a nucleotide sequence defining a cleavage site 
comprising single- or double-stranded nucleic acid in a substrate nucleic acid sequence. 
In one variation, a synthetic (or engineered) enzymatic DNA molecule having an 
endonudease activity is disclosed, wherein the endonudease acthfity is specific for a 
nucleotide sequence defining a cleavage site consisting essentially of a single- or double- 
stranded region of a substrate nucleic acid sequence. 

In yet another embodiment, the invention contemplates an enzymatic DNA 
molecule comprising a deoxyribonudeotlde polymer having a catalytic activity for 
hydrotyzing a nudeic acid-containing substrate to produce substrate cleavage products. 
In one variation, the hydrolysis takes place In a site-specific manner. As noted 
previously, the polymer may be single-stranded, double-stranded, or some combination 
of both. 

The invention further contemplates that the substrate comprises a nucleic acid 
sequence. In various embodiments, the nudeic acid sequence substrate comprises 
RNA, modified RNA, DNA, modified DNA, one or more nucleotide analogs, or 
composites of any of the foregoirig. One embodiment contemplates that the substrate 
includes a single-stranded segment; stitl another embodiment contemplates that the 
substrate is double-stranded. 

The present Invention also contemplates an enzymatic DNA molecule comprising 
a deoxyribonudeotide polymer having a catalytic activity for hydrolyzing a nucleic acid- 
containing substrate to produce a cleavage product. In one variation, the enzymatic 
DNA molecule has an effective binding affinity for the substrate and lacks an effective 
binding affinity for the cleavage product. 

In one preferred embodiment, the invention discloses a non-naturaliy-occurring 
enzymatic DNA molecule comprising a nucleotide sequence defining a conserved core 
flanked by recognition domains, variable regions, and spacer regions. Thus, in one 
preferred embodiment, the nucleotide sequence defines a first variable region contiguous 
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or adjacent to the 5'-terminus of the molecule, a first recognition domain located 3'- 
terminal to the first variable region, a first spacer region located 3*-terminal to the first 
recognition domain, a first conserved region located 3'-terminal to the first spacer 
region, a second spacer region located 3'-termlnal to the first conserved region, a 
second conserved region located 3'-termlnal to the second spacer region, a second 
recognition domain located 3*-termlnal to the second conserved region, and a second 
variable region located 3'-terminal to the second recognition domain. 

In another embodiment, the nucleotide sequence preferably defines a first 
variable region contiguous or adjacent to the 5*-terminus of the molecule, a first 
recognition domain located 3'-termlnaI to the first variable region, a first spacer region 
located 3^terminal to the first recognition domain, a first conserved region located 3*- 
terminal to the first spacer region, a second spacer region located 3'-termlnal to the first 
conserved region, a second conserved region located 3'-terminal to the second spacer 
region, a second recognition domain located 3 -tcrminal to the second conserved region, 
a second variable region located 3'-termlnal to the second recognition domain, and a 
third recognition domain located 3*-terminal to the second variable region. 

In one variation of the foregoing, the molecule includes a conserved core region 
flanked by two substrate binding domains; in another, the conserved core region 
comprises one or more conserved domains. In other preferred embodiments, the 
conserved core region further comprises one or more variable or spacer nucleotides. In 
yet another embodiment, an enzymatic ONA molecule of the present invention further 
comprises one or more spacer regions. 

The present invention further contemplates a wide variety of compositions. For 
exsmple. compositions including an enrymatic DNA molecule as described hereinabove 
are disclosed and contemplated herein. In one alternative embodiment, a composition 
according to the present invention comprises two or more populations of enzymatic 
DNA molecules as described above, wherein each population of enzymatic DNA 
molecules is capable of deaving a different sequence in a substrate. In another 
variation, a composition comprises two or more populations of enzymatic ONA 
molecules as described hereinabove, wherein each population of enzymatic DNA 
molecules Is capable of recognizing a different substrate. In various embodiments, it is 
also preferred that compositions Include a monovalent or divalent cation. 

The present invention further contemplates methods of generating, selecting, 
and isoJating enzymatic DNA molecules of the present Invention. In one variation, a 
method of selecting enzymatic DNA molecules that cleave a nucleic acid sequence (e.g.. 
RNA) at a specific site comprises the following steps: (a) obtaining a population of 
putative enzymatic ONA molecules - whether the sequences are naturally-occurring or 
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synthetic — and preferably, they are single-stranded ONA molecules; (b) admixing 
nucleotide*contalning substrate sequences with the aforementioned population of ONA 
molecules to form an admixture; (c) maintaining the admixture for a sufficient period of 
time and under predetermined reaction conditions to allow the putative enzymatic DNA 
5 molecules in the population to cause cleavage of the substrate sequences, thereby 

producing substrate cleavage products; (d) separating the population of DNA molecules 
from the substrate sequences and substrate cleavage products; and (e) isolating DNA 
molecules that cleave substrate nucleic acid sequences (e.g», RNA) at a specific site 
from the population. 

10 In a further variation of the foregoing method, the ONA molecules that cleave 

substrate nucleic acid sequences at a specific site are tagged with an immobilizing 
agent. In one example, the agent comprises blotin. 

In yet another variation of the aforementioned method, one begins by selecting a 
sequence - e.g., a predetermined "target" nucleotide sequence - that one wishes to 

1 5 cleave using an enzymatic DNA molecule engineered for that purpose. Thus, in one 

embodiment, the pre-selected (or predetermined) 'target" sequence Is used to generate 
a population of DNA molecules capable of cleaving substrate nucleic acid sequences at 
a specific site via attaching or ''tagging* it to a deoxyribonucleic acid sequence 
containing one or more randomized sequences or segments. In one variation, the 

20 randomized sequence Is about 40 nucleotides in length; in another variation, the 

randomized sequence is about 50 nucleotides in length. Randomized sequences that are 
1-40, 40-50, and 50-100 nucleotides in length are also contemplated by the present 
Invention. 

In one embodiment of the present invention, the nucleotide sequence used to 
25 generate a population of enzymatic DNA molecules is selected from the group consisting 

of SEQ ID NO 4, 23, 50 AND 51 . In another embodiment, the "target" or "substrate" 
nucleotide sequence comprises a sequence of one or more ribonucleotides see* e.g., 
the relevant portions of SEQ ID NOS 4 and 23, and SEQ ID NO 49. It is also 
contemplated by the present invention that a useful "target" or 'substrate' nucleotide 
30 sequence may comprise ONA, RNA, or a composite thereof. 

The invention also contemplates methods as described above, wherein the 
isolating step further comprises exposing the tagged DNA molecules to a solid surface 
having avidin linlced thereto, whereby the tagged ONA molecules become attached to 
the solid surface. As before, the substrate may be RNA, DNA, a composite of both, or 
35 a molecule including nucleotide sequences. 

The present invention also contemplates a method for specificelly cleeving a 
substrate nucleic acid sequence at a particular cleavage site, comprising the steps of (a) 
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providing an enzymatic DNA molecule capable of cleaving a substrate nucleic acid 
sequence at a specific cleavage site; and (b) contacting the enrymatic DNA molecule 
with the substrate nucleic acid sequence to cause specific cleavage of the nucleic acid 
sequence at the cleavage site. In one variation, the enzymatic DNA molecule is a non- 
5 naturally-occurring (or synthetic) DNA molecule. In another variation, the enzymatic 

DNA molecule is single-stranded. 

In still another variation of the foregoing method, the substrate comprises a 
nucleic acid. In various embodiments, the substrate nucleic acid comprises RNA, 
modified RNA, DNA. modified DNA, one or more nucleotide analogs, or composites of 

10 any of the foregoing. In yet another embodiment, the specific cleavage is caused by the 

endonuclease activity of the enzymatic DNA molecule. Alteration of reaction conditions 
" e.g., the adjustment of pH, temperature, percent cation, percent enzyme, percent 
substrate, and percent product ~ is also contemplated herein. 

The present invention also contemplates a method of cleaving a phosphoester 

15 bond, comprising (a) admixing an catalytic DNA molecule capable o1 cleaving a 

substrate nucleic acid sequence at a defined cleavage site with a phosphoester bond- 
containing substrate, to form a reaction admixture; and (b) maintaining the admixture 
under predetermined reaction conditions to allow the enzymatic DNA molecule to cleave 
the phosphoester bond, thereby producing a population of substrate products. In one 

20 embodiment, the enzymatic DNA molecule is able to cleave the phosphoester bond in a 

site-specific manner. In another embodiment, the method further comprises the steps of 
(c) separating the products from the catalytic DNA molecule; and (d) adding additional 
substrate to the enzymatic DNA molecule to form e new reaction admixture. 

The present invention also contemplates methods of engineering enzymatic DNA 

25 molecules that cleave phosphoester bonds. One exemplary method comprises the 

following steps: (a) obtaining a population of single-stranded DNA molecules; (b) 
introducing genetic variation into the population to produce a variant population; Ic) 
selecting individuals from the variant population that meet predetermined selection 
criteria; <d) separating the selected individuals from the remainder of the varient 

30 population; and (e) amplifying the selected Individuals. 

pRiPP nPSCRIPTION OF THF DRAWINSS 
Figure 1 illustrates a selective amplification scheme for Isolation of DNAs that 
cleave a target RNA phosphoester. As shown, double-stranded DNA that contains a 
stretch of 50 random nucleotides (the molecule with "N^o" indicated above iti is 

35 amplified by PCR, employing a 5 '-biotinylated DNA primer that is terminated at the 3 ' 

end by an adenosine ribonucleotide (rA). (The biotin label is indicated via the encircled 
letter "BM This primer Is extended by Tag polymerase to yield a DNA product that 
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contains a single embedded ribonucleotide. The resulting double-stranded DNA Is 
Immobilized on a streptavidin matrix and the unbiotinylated DNA strand is removed by 
washing with 0,2 N NaOH. After re-equillbrating the column with a buffered solution, 
the column Is washed with the same solution with added 1 mM PbOAc. DNAs that 
undergo Pb'^-dependent self-cleavage are released from the column, collected in the 
eluant, and amplified by PCR. The PCR products are then used to initiate the next round 
of selective amplification. 

Figure 2 illustrates self-cleavage activity of the starting pool of DNA (GO) and 
populations obtained after the first through fifth rounds of selection (G1 - G5), In the 
presence of lead cation (Pb'*). The symbol Pre represents 108*nucleotlde precursor 
DNA (SEQ ID NO 4); Civ, 28-nucleotide 5 ' cleavage product (SEQ ID NO 5); and M, 
primer 3a (SEQ ID NO 6), which corresponds in length to the 5 ' cleavage product. 

Figure 3 illustrates the sequence alignment of individual variants isolated from 
the population after five rounds of selection. The fixed substrate domain is shown at 
the top, with the target riboadenylate identified via an Inverted triangle. Substrate 
nucleotides that are commonly invohred in presumed base-pairing interactions are 
indicated by vertical bars. Sequences corresponding to the 50 inrtially-randomized 
nucleotides are aligned antiparallel to the substrate domain. All of the variants are 
3 '-terminated by the fixed sequence 5 '-CGGTAAGCTTGGCAC-3 ' (not shown; SEQ ID 
NO 1|. Nucleotides within the initially-randomized region that are presumed to form 
base pairs with the substrate domain are indicated on the right end left sides of the 
Figure; the putative base pair-forming regions of the enzymatic DNA molecules are 
individually boxed in each sequence shown. Conserved regions are illustrated via the 
two large, centrally-located boxes. 

Figures 4A and 46 illustrate DNA>catalyzed cleavage of an RNA phosphoester in 
an intermolecular reaction that proceeds with catalytic turnover. Rgure 4A is a 
diagrammatic representation of the complex formed between the 19mer substrate (3'- 
TCACTATrAGGAAGAGATGG-5', SEQ ID NO 2) and SSmer DNA enzyme I5'- 
ACACATCTCTGAAGTAGCGCCGCCGTATAGTGACGCTA-3', SEQ ID NO 3). The 
substrate contains a single adenosine ribonucleotide TrA*, adjacent to the arrow), 
flanked by deoxyribonucleotides. The synthetic DNA enzyme is a 38-nucleotlde portion 
of the most frequently occurring variant shown in Fig. 3. Highly-conserved nucleotides 
located within the putative catalytic domain are "boxed". As illustrated, one conserved 
sequence is "AGOG", while another is "CG" (reading in the 6*^3* direction). 

Figure 4B shows an Eedie-Hofstee plot used to determine K„ (negative slope) 
and V„ (y-intercept) for DNA-catalyzed cleavage of 15 '-^^PMabeled substrate under 
conditions identical to those employed during in vitro selection. Initial rates of cleavage 
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were determined for reactions involving 5 nM DNA eiayme and either 0.125, 0.5, 1, 2. 

or 4 AtM substrate. 

Figure 5 is a photographic representation showing a polyacrylamide gol 
demonstrating specific endoribonuclease activity of four families of selected catalytic 
5 DNAs. Selection of a Pb^*-dependent family of molecules was repeated In a slda-by- 

side fashion as a control Ifirst group). In the second group, Zn»* Is used as the cation; 
in group three, the cation is Mn»*; and in the fourth group, the cation is Mg**. A fifth 
site on the gel consists of the cleavage product alone, as a marker. 

As noted, there are three lanes within each of the aforementioned four groups. 
10 In each group of three lanes, the first lane shows the lacit of activity of the selected 
population in the absence of the metal cation, the second lane shows the observed 
activity in the presence of the metal cation, and the third lane shows the lack of activity 
of the starting pool (GO). 

Figures 6A and 6B provide two-dimensional illustrations of a •progenitor* 
1 5 catalytic DNA molecule and one of severd catalytic DNA molecules obtained via the 
selective amplification methods disclosed herein, respectively. Figure eA illustrates an 
exemplary molecule from the starting pool, showing the overall configuration of the 
molecules represented by SEQ ID NO 23. As illustrated, various complementary 
nucleotides flank the random (N,o) region. Figure BB is a diagrammatic representation of 
20 one of the Mg'* -dependent catalytic DNA molecules (or "DNAzymes") generated via the 

within-described procedures. The location of the ribonucleotide in the substrate nucleic 
acid is indicated via the arrow in both Figs. 6A and 6B. 

Figure 7 Illustrates some of the results of ten rounds of in vitro selective 
amplification carried out essentiaBy as described In Example 5 herelnbelow. As shown, 
25 two sites and two families of catahrats emerged as displaying the most efficient 

cleavege of the target sequence. Cleavage conditions were essentially as indicated in 
Fig. 7, namely, lOmM Mg'\ pH 7.5. and 37-C; data collected after the reaction ran for 
2 hours is shown. Cleavage (%) is shown plotted against the number of generations 
(here, 0 through 10). The number/prevalence of catalytic DNA molecules capable of 
30 cleaving the target sequence at the indicated sites in the substrate is IHustrated via the 
vertical bars, with cleavage at G1UAACUA6AGAU shown by the striped bars, and With 
cleavage at GUAACUAIGAGAU Illustrated via the open (lightly-shaded) bars. 

Figure 8 ttlustrates the nucleotide sequences, cicavago sites, and turnover rates 
of two catalytic DNA molecules of the present invention, clones 8-17 and 10-23. 
35 Reaction conditions were as shown, namely, lOmM Mg", pH 7.5, and 37'C. The 

DNAzyme identified as clone 8-17 is illustrated on the left, with tiie site of cleavage of 
the RNA substrate indicated by the arrow. The subsuate sequence 15' - 
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GGAAAAAGUAACUAGAGAUGGAAG - 3') - which is separate from the DNAzyme (i.e., 
intermolecular cleavage Is shown) - is labeled as such. Similarly, the DNAzyme 
Identified herein as 10-23 is shown on the right, with the site of cleavage of the RNA 
substrate indicated by the arrow. Again, the substrate sequence Is indicated. For the 8- 
17 enzyme, the turnover rate was approximately 0,6 hr'; for the 10-23 enzyme, the 
turnover rate was approximately 1 hr\ Noncomptementary pairings are indicated with a 
closed circle {•), whereas complementary pairings are Indicated with a vertical line (|). 

Figure 9 further itiustrates the nucleotide sequences, cleavage sites, and 
turnover rates of two catalytic DNA molecules of the present invention, clones 8-17 and 
10-23. Reaction conditions were as shown, namely, lOmM Mg^*, pH 7.5, and 37 X. 
As in Fig. 8, the DNAzyme Identified as clone 8-17 is illustrated on the left, with the site 
of cleavage of the RNA substrate indicated by the arrow. The substrate sequence (5' - 
GGAAAAAGUAACUAGAGAUGGAAG - 3') -which Is separate from the DNAzyme (I.e., 
intermolecular cleavage is shown) - is labeled as such. Similarly, the DNAzyme 
identified herein as 10-23 is shown on the right, with the site of cleavage of the RNA 
substrate indicated by the arrow. Again, the substrate sequence Is indicated. For the 8* 
17 enzyme, k^,, was approximately 0.002 min '; for the 10-23 enzyme, the value of k,b. 
was approximately 0,01 min \ Noncomplementary pairings are indicated with a closed 
circle (•), whereas complementary pairings are indicated with a vertical line (|). 

DFTAILED DESCRIPTION 

A. Definitions 

As used herein, the term "deoxyribozyme" is used to describe a DNA-containing 
nucleic acid that Is capable of functioning as an enzyme. In the present disclosure, the 
term "deoxyribozyme" includes endoribonucleases and endodeoxyribonucleases, 
although deoxyribozymes with endoribonuclease activity are particularly preferred. 
Other terms used Interchangeably with deoxyribozyme herein are "enzymatic DNA 
molecule", 'DNAzyme", or "catalytic DNA molecule", which terms should all be 
understood to include enzymatically active portions thereof, whether they are produced 
synthetically or derived from organisms or other sources. 

The term "enzymatic DNA molecules** also Includes DNA molecules that have 
complementarity in a substrate-binding region to a specified oligonucleotide target or 
substrate; such molecules also have an enzymatic activity which is active to specifically 
cleave the oligonucleotide substrate. Stated in anotHer fashion, the enzymatic DNA 
molecule Is capable of cleaving the oligonucleotide substrate intermolecular ly. This 
complementarity functions to allow sufficient hybridization of the enzymatic DNA 
molecule to the substrate oligonucleotide to allow the intermolecular cleavage of the 
substrate to occur. While one-hundred percent (100%) complementarity is preferred, 
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complemantarity in the range of 75-100% is also uselol and contemplated by the 
present invention. 

Enzymatic DNA molecules of the present invention may alternatively be 
described as having nuclease or ribonudease activity. These terms may be used 
interchangeably herein. 

The term "enzymatic nucleic acid" as used herein encompasses enzymatic RNA 
or DNA molecules, enzymatic RNA-DNA polymers, and enzymatically active portions or 
derivatives thereof, afthoogh enzymatic DNA molecules are a particularly preferred class 
of enzymatically active molecules according to the present invention. 

The term "endodeoxyribonuclease", as used herein, is an enzyme capable of 
cleaving a substrate comprised predominantly of DNA. The term -endoribonuelease", as 
used herein, is an enzyme capable of cleaving a substrate comprised predominantly of 
RNA. 

As used herein, the term "base pair" (bp) is generaUy used to describe a 
partnership of adenine (A) with thymine (T) or uracB (U). or of cytosine (C) with guanine 
(G), although it should be appreciated that less-common analogs of the bases A. T. C. 
and G (as well as U) may occasionally participate in base pairings. Nucleotides that 
normally pair up when DNA or RNA adopts a double stranded configuration may also be 
referred to herein as "complementary bases'. 

•Complementary nucleotide sequence" generally refers to a sequence of 
nucleotides in a single-suanded molecule or segment of DNA or RMA that is suffidently 
complementary to that on another single oligonucleotide strand to specifically hybridize 
to it with consequent hydrogen bonding. 

-Nucleotide" generally refers to a monomeric unit of DNA or RNA consisting of a 
sugar moiety (pentose), a phosphate group, and a nitrogenous heterocyclic base. The 
base Is linked to the sugar moiety via the glycosidic carbon (V carbon of the pentose) 
and that combination of base and sugar is a "nucleoside*. When the nucleoside 
contains a phosphate group bonded to the 3' or 5' position of the pentose, it is referred 
to as a nucleotide. A sequence of operatlvely linked nucleotides is typically referred to 
herein as a "base sequence" or "nucleotide sequence", and their grammatical 
equivalents, and is represented herein by a tonnula whoso left to right orientation is in 
the conventional direction of 6'-tem>inus to 3'-tenninus. unless otherwise specified. 

"Nucleotide analog" generally refers to a purine or pyrimldine nucleotide that 
differs structurally from A, T, G, C, or U, but is sufficiently similar to substitute for the 
normal nucleotide in a nucleic acid molecule. As used herein, the term "nucleotide 
analog- encompasses altered bases, different or unusual sugars (l.e. sugars other than 
the "usual- pentose), or a combination of the two. A listing of exemplary anelogs 
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wherein the base has been altered is pravided in section C hereinbelow. 

"Oligonucleotide or polynucleotide* generally refers to a polymer of single- or 
double-stranded nucleotides. As used herein, "oligonucleotide'* and its grammatical 
equivalents will include the full range of nucleic acids. An oligonucleotide wit! typically 
5 refer to a nucleic acid molecule comprised of a linear strand of ribonucleotides. The 

exact size will depend on many factors, which in turn depends on the ultimate 
conditions of use, as is well Icnown in the art. 

As used herein, the term ''physiologic conditions* Is meant to suggest reaction 
conditions emulating those found in mammalian organismis, particularly humans. While 

10 variables such as temperature, availability of cations, and pH ranges may vary as 

described in greater detail below, 'physiologic conditions" generally comprise a 
temperature of about 35-40''C, with ST'^C being particularly preferred, as well as a pH 
of about 7.0-6.0, with 7,5 being particularly preferred, and further comprise the 
availability of cations, preferably divalent and/or monovalent cations, with a 

15 concentration of about 2-15 mM Mg^^ and 0-1.0 M Na+ being particularly preferred. 
"Physiologic corKiitions", as used herein, may optionally include the presence of free 
nucleoside cofactor. As noted previously, preferred conditions are descrik>ed in greater 
detail below. 

B. Engymatic PNA Molecules 

20 In various embodiments, an enzymatic DNA molecule of the present invention 

may combine one or more modifications or mutations including additions, deletions, and 
substitutions. In alternative embodiments, such mutations or modifications may be 
generated using methods which produce random or specific mutations or modifications. 
These mutations may, for example, change the length of, or alter the nucleotide 

25 sequence of, a loop, a spacer region or the recognition sequence (or domain). One or 

more mutations within one catalytically active enzymatic DNA molecule may be 
combined with the mutation(5) within a second catalyticdlly active enzymatic DNA 
molecule to produce a new enzymatic DNA molecule containing the mutations of both 
molecules. 

30 In other preferred embodiments, an enzymatic DNA molecule of the present 

Invention may have random mutations introduced Into it using a variety of methods well 
known to those skilled in the art. For example, the methods described by Cadwell and 

Joyce fPCR Methods and Applications 2 : 28-33 n992)) are particularly prefen-cd for use 
as disclosed herein, with some modifications, as described in the Examples that follow. 
35 (Also see Cadwell and Joyce, PGR Methods and Appl ications 3 (SudpH: SI 36-81 40 

(1994).) According to this modified PGR method, random point mutations may be 
Introduced into cloned genes. 
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The aforementioned methods have been used, for example, to mutegenlze genes 
encoding ribozymes with a mutation rate of 0.66% ± 0.13% 195% confidence interval) 
per position, as determined by sequence analysis, with no strong preferences observed 
with respect to the type ot base substitution. This allows the introduction of random 
5 mutations at any position in the enzymatic DNA molecules of the present invention. 

Another method useful In introducing defined or random mutations is disclosed 
in Joyce and Inoue. Mudeic Acids Research 17 ; 71 1-722 l^ 989). This latter method 
involves excision of a template (coding) strand of a double-stranded DNA, reconstruction 
of the template strand with inclusion of mutagenic oligonucleotides, and subsequent 
10 transcription of the partially-mismatched template. This allows the introduction of 

defined or random mutations at any position In the molecule by including 
polynucleotides containing Icnown or random nucleotide sequences at selected positions. 

Enzymatic DNA molecules of the present Invention may be of varying lengths 
and folding panerns, as appropriate, depending on the type and function of the 
1 5 molecule. For example, enzymatic DNA molecules may be about 1 5 to about 400 or 

more nucleotides In length, although a length not exceeding about 250 nucleotides is 
preferred, to avoid limiting the therapeutic usefulness of molecules by making them too 
large or unwieldy. In various preferred embodiments, an enzymatic DNA molecule of the 
present invention is at least about 20 nucleotides in length and, while useful molecules 
20 may exceed 100 nucleotides in length, preferred molecules are generally not more than 

about 100 nucleotides in length. 

In various therapeutic applications, enzymatic DNA molecules of the present 
Invention comprise the enzymaticafly active portions of deoxyriboiymes. In various 
embodiments, enzymatic DNA molecules of the present invention preferably comprise 
25 not more than about 200 nucleotides. In other embodiments, a daoxyrlbozyme of the 

present Invention comprises not more than about 100 nucleotides. In still other 
preferred embodiments, deoxyribozymes of the present invention are about 20-75 
nucleotides in length, more preferably about 20-65 nucleotides In length. Other 
preferred enzymatic DNA molecules are about 10-50 nucleotides In length. 
30 In other applications, enzymatic DNA molecules may assume configurations 

similar to those of -hammerhead- ribozymes. Such enzymatic DNA molecules are 
preferably no more than about 75-100 nucleotides in length, with a length of about 20- 
50 nucleotides being particularly preferred. 

In general, H one intends to synthesize molecules for use es disclosed herein, the 
35 larger the enzymatic nucleic acid molecule is. the more difficult it Is to synthesize. 

Those of skill in the art will certainly appreciate these design constraints. Nevertheless, 
such larger molecules remain within the scope of the present invention. 
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It is also to be understood that an enzymatic DNA molecule of the present 
invention may comprise enzymatically active portions of a deoxyribozyme or may 
comprise a deoxyribozyme with one or more mutations, e.g., with one or more base- 
pttlr-f orming sequences or spacers absent or modified, as long as such deletions, 
additions or modifications do not adversely Impact the molecule's ability to perform as 
an enzyme. 

The recognition domain of an enzymatic DNA molecule of the present Invention 
typically comprises two nucleotide sequences flanking a catalytic domain, and typically 
contains a sequence of at least about 3 to about 30 bases, preferably about 6 to about 
1 5 bases, which are capable of hybridizing to a complementary sequence of bases 
within the substrate nucleic acid giving the enzymatic DNA molecule its high sequence 
specificity. Modification or mutation of the recognition site via well-known methods 
allows one to alter the sequence specificity of an enzymatic nucleic acid molecule. 
(See, e.g, Joyce et al.. Nucleic Ariris Research 17; 711-712 (1989.)) 

Enzymatic nucleic acid molecules of the present invention also include those 
with altered recognition sites or domains. In various embodiments, these altered 
recognition domains confer unique sequence specificities on the enzymatic nucleic acid 
molecule including such recognition domains. The exact bases present in the 
recognition domain determine the base sequence at which cleavage will take place. 
Cleavage of the substrate nucleic acid occurs within the recognition domain. This 
cleavage leaves a 2\ 3\ or 2',3'-cyclic phosphate group on the substrate cleavage 
sequence and a 5' hydroxy! on the nucleotide that was originally immediately 3' of the 
substrate cleavage sequence in the original substrate. Cleavage can be redirected to a 
site of choice by changing the bases present in the recognition sequence (internal guide 
sequence). See Murphy et al.i Proc. Natl. Acad. Sci. USA 86 : 9218-9222 (1989), 
Moreover, it may be useful to add a polyamine to facilitate recognition and 
binding between the enzymatic DNA molecule and its substrate. Examples of useful 
polyamines include spermidine, putrescine or spermine. A spermidine concentration of 
about 1 mM may be effective in particular embodiments, while concentrations ranging 
from about 0.1 mM to about 10 mM may also be useful. 

In various alternative embodiments, an enzymatic DNA molecule of the present 
invention has an enhanced or optimized ability to cleave nucleic acid substrates, 
preferably RNA substrates. As those of skill in the art will appreciate, the rate of an 
enzyme-catalyzed reaction varies depending upon the substrate and enzyme 
concentrations and^ in general, levels off at high substrate or enzyme concentrations. 
Taking such effects into account, the kinetics of an enzyme-catalyzed reaction may be 
described in the following terms, which define the reaction. 
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■nie enhanced or optimized ability of an enzymatic DNA molecule of the present 
invention to cleave an RNA substrate may be determined in a cleavage reaction writh 
varying amounts of labeled RNA substrate in the presence of enzymatic DNA molecule. 
The ability to cleave the substrate is generally defined by the catalytic rate Ik^ divided 
5 by the Michaelis constant (KJ. The symbol represents the maximal velocity of an 
enzyme reaction wrhan the substrate approaches a saturation value. Km represents the 
substrate concentration at which the reaction rate is one-half maximal. 

For example, values for K^, and lc„, may be determined in this invention by 
experiments in which the substrate concentration ISl is in excess over enzymatic DNA 

10 molecule concentration [EJ. Initial rates of reaction (v,) over a range of substrate 

concentrations are estimated from the Initial linear phase, generally the first 5% or less 
of the reaction. Data points are fit by e least squares method to a theoreticel line given 
by the equation: v = -K„(v^|Sl) + V^. Thus. Ic« and K„ are determined by the initial 
rate of reaction, v.. and the substrate concentration [S). 

15 in various alternative embodiments, an enzymatic DNA molecule of the present 

Invention has an enhanced or optimized ability to cleave nucleic acid substrates, 
preferably RNA substrates. In preferred embodiments, the enhanced or optimized ability 
of an enzymatic DNA molecule to cleave RNA substrates shows about a 10- to lOMold 
improvement over the uncatalyzed rate. In more preferred embodiments, an enzymatic 

20 DNA molecule of the present invention Is able to cleave RNA substrates at a rate that is 

about 10»- to lOMold improved over -progenitor" species. In even more preferred 
embodiments, the enhanced or optimized abrtity to cleave RNA substrates is expressed 
as a 10*- to 10»-fold improvement over the progenitor species. One skilled in the art will 
appreciate that the enhanced or optimized ability of an enzymatic DNA molecule to 

25 cleave nucleic acid substrates may vary depending upon the selection constraints 

applied during the in vftro evolution procedure of the invention. 

Various preferred methods of modifying deoxyribozymes and other enzymatic 
DNA molecules and nucleases of the present invention are further described In Examples 

1-3 hereinbelow. 

30 C. M»rli»ntiHB Analogs 

As noted above, the term "nucleotide analog" as used herein generally refers to 
a purine or pyrimldine nucleotide that differs structurally from A, T. G, C, or U, but is 
sufficiently similar to substitute for such "normal- nucleotides In a nucleic acid molecule. 
As used herein, the term "nucleotide analog" encompasses altered bases, different (or 

35 unusual! sugars, altered phosphate backbones, or any combination of these alterations. 

Examples of nucleotide analogs useful according to the present invention include those 
Usted in the following Table, most of which are found In the approved listing of modified 
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bases at 37 CPR §1 .822 {which Is incorporated herelr> by reference). 

Table 1 

Nucleotide Analogs 



Abbrevtatwn 



PBScriPtipn 



10 



15 



20 



25 



30 



35 



ac4c 

chmSu 

cm 

cmnm5s2u 
d 

fm 
gaiq 
gm 
I 

16a 

mla 

mif 

mig 

mil 

m22g 

m2a 

m2g 

m3c 

m5c 

m6a 

m7g 

mamSu 

mam5s2u 

manq 

mcm5s2u 

mo5u 

ms2i6a 

ms2t6a 

mt6a 



4- acetylcytidine 

5- (carboxyhydroxylmethyl)uridine 
2'-0-methylcytldine 

5-carboxymethylaminomethy)-2-thiourldlne 

dihydrouridine 

2**0*methylpseudouridine 

D-gatactosytqueosine 
2*-0-methylguanosine 
Inosine 

N6-lsopentenyladenosine 
1-methyladenoslne 
1 -methyipseudouridina 
1 -methylguanosina 

1- methylinosine 
2,2-dlmethylguanosine 

2- rnethyladenosine 

2- methylguanosme 

3- methylcytidine 
5-methylcytldlne 
N6-methyladenosine 
7-mBthylguano5tne 
5-methylamlnomethyturidtne 
5-methoxyamlnomethyl-2-thiouridine 

D-mannosytmethyluridine 
5-methoxycarbonylmethylurjdlne 
S-methoxyuridine 

2-methytthto-N6-isopentenyladenosine 

N-((9-&-D-ribofuranosyl-2-mBthylthiopurine-6- 

yl)carbarnoyl)threonine 

N-((9-G-D-rlbofuranosy^urtne-6-yl)N-methyt- 

carbamoyDthreonine 
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(Table 1, cont'd) 



Ahbreviation 



Pcsgrtption 



10 



15 



20 



mv 

o5u 

osyw 

P 

q 

s2c 
s2t 
s2u 
54u 
t 

tea 

um 
yw 

X 

araU 
araT 



uridine- S-oxyacetic acid methylaster 

uridine- 5-oxyacetic acid (v) 

wybutoxosine 

pseudouridine 

queosine 

2-thiocytidin8 

5-nn ethyl- 2-t hi ourldine 

2- thloundine 

4- thiouridine 

5- methyluridine 

N-((9-B-D-ribofuranosylpurine-6-yl)carbamoyl)thr8onlnetm 

2'-0-meihyI-5-mBthyluridine 

2*-0-methvlurldlne 

wybutosine 

3- |3-amino-3-carboxypropyl)uridine, (acp3)u 
a« D-arabinosyt 

6, D-arabinosyt 



25 Other useful analogs Include those described in published International 

application no. WO 92/20823 (the disclosures of which are incorporated herein by 
reference), or analogs made according to the methods disclosed therein. Analogs 
described in DeMesmaeker, et al., Ann^w. Chem Int. Ed. Engl. 33: 226-229 11994); 
DeMesmaeker, et al., Svnlen : 733-736 {Oct. 1993); Nielsen, et al., SfflflnCB 2 54 : 1497- 

30 1 500 (1 991); and Idzlak, et al., TffTmhfTt"-" ' ''^^^^ ^4: 541 7-5420 (1 993) are also 

useful according to the within-disclosed Invention and said disclosures are Incorporated 

by reference herein. 

0. I^Pthods of E noinftftrina gnrYf"^^''^ D^A Motecuies 
The present invention also contemplates methods of producing nucleic acid 
35 molecules having a predetermined activity. In one preferred embodiment, the nucleic 

ecld molecule is an enzymatic DNA molecule. In another variation, the desired activity is 

a catalytic activity. 

In one embodiment, the present Invention contemplates methods of synthesizing 
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enzymatic DNA molecules that may then be "engineered** to catalyze a specific or 
predetermined reaction. Methods of preparing enzymatic DNA molecules are described 
herein; see, e.g.. Examples 1-3 herelnbelow. In other embodiments, an enzymatic DNA 
molecule of the present invention may be engineered to bind small molecules or ligands, 
such as adenosine triphosphate (ATP). (See, e.g., Sassanfar, et al.. Nature 364 : 550- 
553 11993).) 

fn another embodiment, the present invention contemplates tliat a population of 
enzymatic DNA molecules may be subjected to mutagenizing conditions to produce a 
diverse population of mutant enzymatic DNA molecules (which may alternatively be 
called "deoxyribozymes" or *DNAzymes"). Thereafter, enzymatic DNA molecules having 
desired characteristics are selected and/or separated from the population and are 
subsequently amplified. 

Alternatively, mutations may be introduced in the enzymatic DNA molecule by 
altering the length of the recognition domains of the enzymatic DNA molecule. The 
recognition domains of the enzymatic DNA molecule associate with a complementary 
sequence of bases within a substrate nucleic acid sequence. Methods of altering the 
length of the recognition domains are known in the art and include PGR, for example; 
useful techniques are described further in the Examples below. 

Alteration of the length of the recognition domains of an enzymatic DNA 
nfK>lecuie may have a desirable effect on the binding specificity of the enzymatic DNA 
molecule. For example, an increase In the length of the recognition domains may 
Increase binding specificity between the enzymatic DNA molecule and the 
complementary base sequences of an oligonucleotide in a substrate, or may enhance 
recognition of a panicular sequence in a hybrid substrate, in addition, an increase in the 
length of the recognition domains may also increase the affinity with which it binds to 
substrate. In various embodiments, these altered recognition domains in the enzymatic 
ONA molecule confer increased binding specificity and affinity between the enzymatic 
DNA molecule and its substrate. 

It has recently been noted that certain oligonucleotides are able to recognize and 
bind molecules other than oligonucleotides with complementary sequences. These 
oligonucleotides are often given the name "aptamers". For example, Ellington and 
Szostak describe RNA molecules that are able to bind a variety of organic dyes (liatUffi 
24£: 818-822 (1990)), while Bock, et al. describe ssDNA molecules that bind human 
thrombin ( Nature 355 : 564-566 (19921). Similarly, JeUinek, et al. describe RNA ligands 
to basic fibroblast growth factor ( PNAS USA 90: 11227-11231 (1993)). Thus, it Is 
further contemplated herein that the catalytlcaily active DNA enzymes of the present 
invention may be engineered according to the within-described methods to display a 



wo 96/17086 



PCTAJS95/155S0 



-21- 

variety of capabilities typically associated with aptamers. 

One of skill in the art should thus appreciate that the enzymatic DNA molecules 
of this invention can be altered at any nucleotide sequence, such as the recognition 
domains, by various methods disclosed herein, including PGR and 3SR (self-sustained 
5 sequence replication - see Example 1 below). For example, additional nucleotides can 

be added to the 5' end of the enzymatic DNA molecule by Including additional 
nucleotides In the primers. 

Enzymatic DNA molecules of the present invention may also be prepared or 
engineered In a more non-random fashion via use of methods such as site-directed 

10 mutagenesis. For example, site-directed mutagenesis may be carried out essentially as 

described in Morinaga, et al., Biotechnology 2: 636 (1984), modified as described 
herein, for application to deoxyribozymes. Useful methods of engineering enzymatic 
DNA molecules are further described in the Examples below. 

In one disclosed embodiment, an enzymatic DNA molecule of the present 

1 5 invention comprises a conserved core flanked by two substrate binding (or recognition) 

domains or sequences that Interact with the substrate through base-pairing interactions. 
In various embodiments, the conserved core comprises one or more conserved domains 
or sequences. In another variation, en enzymatic DNA molecule further comprises a 
"spacer" region (or sequence) between the regions (or sequences) involved in base 

20 pairing. In still another variation, the conserved core is "intermpted" at various intervals 

by one or more less-conserved variable or "spacer" nucleotides. 

In various embodiments, the population of enzymatic DNA molecules is made op 
of at least 2 dHf erent types of deoxyribozyme molecules. For example, in one variation, 
the molecules have differing sequences. In another variation, the deoxyribozymes are 

25 nucleic ocid molecules having a nucleic acid sequence defining a recognition domain that 

is contiguous or adjacent to the 5'-terminus of the nucleotide sequence. In various 
alternative embodiments, enzymatic DNA molecules of the present invention may further 
comprise one or more spacer regions located 3'-termlnal to the recognition domains, one 
or more loops located 3'-terminal to the recognition domains and/or spacer regions. In 

30 other variations, a deoxyribozyme of the present invention may comprise one or more 

regions which are capable of hybridizing to other regions of the same molecule. Other 
characteristics of enzymatic DNA molecules produced according to the presently- 
disclosed methods are described elsewhere herein. 

In other embodiments, mutagenlzing conditions include conditions that introduce 

35 either defined or random nucleotide substitutions within an enzymatic DNA molecule. 

Examples of typical mutagenizing conditions include conditions disclosed in other parts 
of this specification and the methods described by Joyce et al., Nlirl . AniftS RgS , 17 : 
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711-722 (1989); Joyce, ficne 82 : 83-87(1989); and Beaudry and Joyce, Sclanca 257 : 
635'41 (1992). 

tn still other embodiments, a diverse population of mutant enzymatic nucleic acid 
molecules of the present invention Is one that contains at least 2 nucleic add molecules 
5 that do not have the exact same nucleotide sequence. In other variations, from such a 

diverse population, an enzymatic DNA molecule or other enzymatic nucleic acid having a 
predetermined activity is then selected on the basis of its ability to perform the 
predetermined activity. In various embodiments, the predetermined activity comprises, 
without limitation, enhanced catalytic activity, decreased Km. enhanced substrate 

10 binding ability, altered substrate specificity* and the like. 

Other parameters which may be considered aspects of enzyme performance 
Include catalytic activity or capacity, substrate Mnding ability, enzyme turnover rate, 
enzyme sensitivity to feedback mechanisms, and the like. In certain aspects, substrate 
specificity may be considered an aspect of enzyme performance, particularly in 

15 situations in which an enzyme is able to recognize and bind two or more competing 

substrates, each of which affects the enzyme's performance with respect to the other 
substrate(s). 

Substrate specificity, as used herein, may refer to the specificity of an enzymatic 
nucleic add molecule as described herein for a particular substrate, such as one 

20 comprising ribonudeotides only, deoxyribonucleotides only, or a composite of both. 

Substrate molecules may also contain nucleotide analogs. In various embodiments, an 
enzymatic nucleic acid molecule of the present invention may preferentially bind to a 
particular region of a hybrid or non-hybrid substrate. 

The term or parameter identified herein as "substrate specificity" may also 

25 include sequence specificity; I.e., an enzymatic nucleic acid molecule of the present 

invention may "recognize" and bind to a nucleic acid substrate having a particular 
nucleic acid sequence. For example, if the substrate recognition domains of an 
enzymatic nucleic add molecule of the present Invention will only bind to substrate 
molecules having a series of one or two ribonudeotides (e.g., rA) in a row, then the 

30 enzymatic nucleic acid molecule will tend not to recognize or bind rujcteic acid substrate 

molecules lacking such a sequence. 

With regard to the selection process, in various embodiments, selecting indudes 
any means of physically separating the mutant enzyrinatic nucleic acids having a 
predetermined activity from the diverse population of mutant enzymatic nucleic acids. 

35 Often, selecting comprises separation by size, by the presence of a catalytic activity, or 

by hybridizing the mutant nucleic acid to another nucleic acid, to a peptide, or some 
other molecule that Is either in solution or attached to a solid matrix. 



WOM/17086 



PCT/US95/15580 



-23- 

In various embodiments, the predetermined activity is such that the mutant 
enzymatic nucleic add having the predetermined activity becomes labeled in some 
fashion by virtue of the activity. For example, the predetermined activity may be an 
enzymatic ONA molecule activity whereby the activity of the mutant enzymatic nucleic 
5 acid upon its substrate causes the mutant enzymatic nucleic acid to become covalently 

linked to it. The mutant enzymatic nucleic acid is then selected by virtue of the 
covalent linkage. 

In other embodiments, selecting a mutant enzymatic nucleic acid having a 
predetermined activity includes amplification of the mutant enzymatic nucleic acid (see, 
10 e.g.. Joyce, fipn^ 82: 83-87 (1 989); Beaudry and Joyce, f>fil(tnre 257 : 635-41 (1992)). 
Other methods of selecting an enzymatic nucleic acid molecule having a predatamiined 
characteristic or activity are described in the Examples section. 
E. r.nmpositions 

The invention also contemplates compositions containing one or more types or 
1 5 populations of enzymatic DNA molecules of the present invention; e.g.. different types 
or populations may recognize and cleave different nucleotide sequences. Compositions 
may further include a ribonucleic acid-containing substrate. Compositions according to 
the present invention may further comprise lead ion, magnesium ion. or other divalent or 
monovalent cations, as discussed herein. 
20 Preferably, the enzymatic DNA molecule is present at a concentration of about 

0.05 pM to about 2 iM. Typically, the enzymatic DNA molecule is present at a 
concentration ratio of enzymatic DNA molecule to subsuate of from about 1 :5 to about 
1:50. More preferably, the enzymatic ONA molecule is present in the composition at a 
concentration of about 0.1 to about 1 MM. Even more preferably, compositions 
25 contain the enzymatic DNA molecule at a concentration of about 0.1 to about 0.5 
^M. Preferably, the substrate is present in the composition at a concentration of about 

0.5 fAA to about 1000 /M. 

One skilled In the art will understand that there are many sources of nucleic 

acid-containing substrates including naturally-occurring and synthetic sources. Sources 
30 of suitable substrates include, without limitation, a variety of viral and retroviral agems, 

including HIV-1. HIV-2. HTLV-I, and HTLV-II. 

Other suitable substrates Include, without limitation, viral and retroviral agents 

including those comprising or produced by picornaviruses, hepadnaviridae (e.g., HBV, 

HCV), papillomaviruses (e.g.. HPV), gammaherpesvlrinae (e.g.. EBV). 
35 lymphocryptoviruses, leukemia viruses (e.g., HUV-I and -II). flavWiruses. togaviruses, 

herpesviruses (including alphaherpesvimses and betaherpesvln»as|. cytomegaloviruses 

,CMV) Influenza viruses, and viruses and retroviruses contributing to immunodeficiency 
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diseases and syndromes (e.g., HfV-1 and -2). In addition, suitable substrates include 
viral and retroviral agents which infect non-human primates and other animals including, 
without limitation, the simian end feline immunodeficiency viruses and bovine leukemia 
viruses. 

Magnesium ion, lead ion, or another suitable monovalent or dh^alent cation, as 
described previously, may also be present in the composition, at a concentration ranging 
from about 1-100 mM. More preferably, the preselected ion is present in the 
composition at a concentration of about 2 mM to about 50 mM, with a concentration of 
about 5 mM being particularly preferred. One skilled in the art will understand that the 
ion concentration is only constrained by the limits of solubility of its source (e.g. 
magnesium) in aqueous solution and a desire to have the enzymatic DMA molecule 
present In the same composition in an active conformation. 

The invention also contemplates compositions containing an enzymatic DNA 
molecule of the present invention, hybrid deoxyribonucleotide-ribonucleotide molecules, 
and magnesium or lead ion In concentrations as described hereinabove. As noted 
previously, other monovalent or divalent ions (e.g., Ca^*^) may be used in place of 
magnesium. 

Also contemplated by the present invention are compositions containing an 
enzymatic DNA molecule of the present invention, nucleic acid-containing substrate (e.g. 
RNAI, and a preselected Ion et a concentration of greater than about 1 milltmolar, 
wherein said substrate is greater in length than the recognition domains present on the 
enzymatic DNA molecule. 

In one variation, a composition comprises an enzymatic DNA molecule-substrate 
complex, wherein base pairing between an enzymatic DNA molecule and its substrate is 
contiguous. In another embodiment, base pairing between an enzymatic DNA molecule 
and Its substrate is interrupted by one or more noncomplementary pairs. In a variety of 
alternative embodiments, a composition of the present invention may further comprise a 
monovalent cation, a divalent cation, or both. 

In another variation, an enzymatic DNA molecule of the present invention is 
capable of functioning efficiently in the presence or absence of a divalent cation. In one 
variation, a divalent cation is present and comprises Pb^*, Mg^*, Mn'*, 2n'*, or Ca**. 
Alternatively, an enzymatic DNA molecule of the present invention is capable of 
functioning efficiently in the presence or absence of monovalent cations. It is 
anticipated that monovalent or divalent cation concentrations similar to those described 
herein for Pb'* or Mg^^ will be useful as disclosed herein. 

Optionally, monovalent cations may also t>e present in addition to, or as 
"alternatives" for, divalent cations. For example, monovalent cations such as sodium 
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(Na*) or potassium |K*) may be present, either as dissociated ions or in the form of 
dissociable compounds such as NaCI or KCL 

In one embodiment, the concentration of monovalent cation present In the 
composition ranges from 0 - 1 .0 M. In another embodiment, a monovalent cation is 
5 present In a concentration ranging from about 0-200 mM, In other embodiments, 

monovalent cations are present In a concentration ranging from about 1-100 mWI. 
Alternatively, the concentration of monovalem cations ranges from about 2 mM - 50 
mM. In still other embodiments, the concentration ranges from about 2 mM - 25 mM. 
F. Mftthods of Using EnzvrnffT'^ Molecules 
10 The methods of using enzymatic DNA molecules as disclosed herein are legion. 

As discussed previously, molecules capable of cleaving the bonds linking neighboring 
nucleic acids (e.g., phosphoester bonds) have numerous uses encompassing a wide 
variety of applications. For example, enzymatic DNA molecules having the wlthln- 
disclosed capabilities, stmctures. and/or functions are useful in pharmaceutical and 
1 5 medical products <e.g., for wound debridement, clot dissolution, etc.), as well as in 

household Items Je.g., detergents, dental hygiene products, meat tenderizers). Industrial 
utility of the withln-dlsclosed compounds, compositions and methods is also 
contemplated and well within the scope of the present Invention. 

The present Invention also describes useful methods for cleaving any single- 
20 stranded, looped, partially or fully double-stranded nucleic acid; the majority of these 

methods employ the novel enrymatically active nucleic acid molecules of the present 
Invention. In various embodimems, the slngle-stranded nucleic acid segment or portion 
of the substrate (or the entire substrate Itself) comprises DNA. modified DNA, RNA, 
modified RNA, or composites thereof. Preferably, the nucleic acid substrate need only 
25 be single-stranded at or near the substrate cleavege sequence so that an enzymatic 

nucleic acid molecule of the presem invention can hybridize to the substrate cleavage 
sequence by virtue of the enzyme's recognition sequence. 

A nucleic acid substrate that can be cleaved by a method of this invention may 
be chemically synthesized or enzymatlcally produced, or It may be isolated from various 
30 sources such as phages, viruses, prokaryotlc cells, or cukaryotic cells, including animal 

cells, plant cells, yeast cells and bacterial cells. Chemically synthesized single- and 
double-stranded nucleic acids are commercially avaHable from many sources including, 
without limitation. Research Genetics (Huntsville, AU'. 

RNA substrates may also be synthesized using an Applied Biosystems (Foster 
36 City, CA) oligonucleotide synthesizer according to the manufacturer's instmctions. 

Single stranded phage are also a source of nucleic acid substrates. (See, e.g.. Messing 
et al., PMAS USA 74 : 3642-3646 (1977), and Yanlsch- Perron et al., Q^Q&Jl: 103-119 
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(1985).) Bacterial cells containing sing(e*stranded phage would also be a ready source 
of suitable single-stranded nucleic acid substrates. 

Single-stranded RNA cleavable by a method of the present invention could be 
provided by any of the RNA viruses such as the picorna viruses, togaviruses, 
5 orthomyxoviruses, paramyxoviruses, rhabdoviruses, coronavlruses, arenaviruses or 

retroviruses. As noted previously, a wide variety of prokaryotlc and eukaryotic cells 
may also be excellent sources of suitable nucleic acid substrates. 

The methods of this invention may be used on single-stranded nucleic acids or 
single-stranded portions of looped or double-stranded nucleic acids that are present 

1 0 inside a cell, including eukaryotic, procaryotic, plant, animal, yeast or bacterial cells. 

Under these conditions an enzymatic nucleic acid molecule (e.g., an enzymatic DNA 
molecule or deoxyribozyme) of the present invention could act as an anti-viral agent or a 
regulator of gene expression. Examples of such uses of enzymatic DNA molecules of 
the present invention are described further hereinbelow. 

15 In the majority of methods of the present invention, cleavage of single-stranded 

nucleic acids occurs at the 3'-terminus of a predetermined base sequence. This 
predetermined base sequence or substrate cleavage sequence typically contains from 1 
to about 10 nucleotides. In other preferred embodiments, an enzymatic DNA molecule 
of the present invention is able to recognize nucleotides either upstream, or upstream 

20 and downstream of the cleavage site. In various embodiments, an enzymatic DNA 

molecule is able to recognize about 2-10 nucleotides upstream of the cleavage site; In 
other embodiments, an enzymatic DNA molecule is able to recognize about 2-10 
nucleotides upstream and about 2-10 nucleotides downstream of the cleavage site. 
Other preferred embodiments contemplate an enzymatic DNA molecule that is capable 

25 of recognizing a nucleotide sequence up to about 30 nucleotides in length, with a length 

up to about 20 nucleotides being even more preferred. 

The within-disclosed methods allow cleavage at any nucleotide sequence by 
altering the nucleotide sequence of the recognition domains of the enzymatic DNA 
molecule. This allows cleavage of single-stranded nucleic ecid in the absence of a 

30 restriction endonuclease site at the selected position. 

An enzymatic DNA molecule of the present invention may be separated from any 
ponlon of the single-stranded nucleic acid substrate that remains attached to the 
enzymatic DNA molecule by site-specific hydrolysis at the eppropriate cleavage site. 
Separation of the enzymatic DNA molecule from the substrate (or "cleavage product") 

35 allows the enzymatic DNA molecule to carry out another cleavage reaction. 

Generally, the nucleic acid subsuate is treated under appropriate nucleic acid 
cleaving conditions - preferably, physiologic conditions - with an effective amount of 
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an enzymatic DNA molecule of the present invention. If the nucleic acid substrate 
comprises DNA, cleaving conditions may include the presence of a divalent cation at a 
concentration of about 2-1 OmM. 

An effective amount of an enzymatic DNA molecule is the amount required to 
5 cleave a predetermined base sequence present within the single-stranded nudeic acid. 

Preferably, the enzymatic DNA molecule Is present at a molar ratio of DNA molecule to 
substrate cleavage sites of 1 to 20. This ratio may vary depending on the length of 
treating and ef fidency of the particular enzymatic DNA molecule under the particular 
nucleic acid deavage conditions employed. 

10 Thus, in one preferred embodiment, treating typically involves admixing, in 

aqueous solution, the RNA-containing substrate and the enzyme to form a deavage 
admixture, and then maintaining the admixture thus formed under RNA cleaving 
conditions for a time period sufficient for the enzymatic DNA molecule to deave the 
RNA substrate at any of the predetermined nudeotide sequences present in the RNA. In 

1 5 various embodiments, a source of Ions Is also provided - i.e. monovalent or divalent 

cations, or both. 

In one embodiment of the present invention, the amount of time necessary for 
the enzymatic DNA molecule to cleave the single-stranded nucleic acid has been 
predetermined. The amount of time is from about 1 minute to about 24 hours and will 

20 vary depending upon the concentration of the reactants and the temperature of the 

reaction. Usually, this time period Is from about 10 minutes to about 2 hours such that 
the enzymatic DNA molecule deaves the single-strended nudeic acid at any of the 
predetermined nudeotide sequences present. 

The invention further contemplBtes that the nucleic acid deaving conditions 

25 include the presence of e source of divalent cations (e.g., PbOAc) at a concentration of 

about 2-1 00 mM. Typically, the nudeic acid deaving conditions indude divalent cation 
at a concentration of about 2 mM to about 10 mM, with a concentration of about 5 mM 
being panlcularly preferred. 

The optimal cationic concentration to Include In the nudeic add cleaving 

30 conditions can be easily determined by determining the amount of single-stranded 
nucleic acid deaved at a given cation concentration. One sicilled in the art will 
understand that the optimal concentration may vary depending on the particular 
enzymatic DNA molecule employed. 

The present invention further contemplates that the nucleic add deaving 

35 conditions Include a pH of about pH 6.0 to about pH 9.0. In one preferred embodiment, 

the pH ranges from about pH 6.5 to pH B.O. In another preferred embodiment, the pH 
emulates physiological conditions. I.e., the pH Is about 7.0-7.8. with a pH of about 7.5 
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being particularly preferred. 

One skilled in the art will appreciate that the methods of the present invention 
will work over a wide pH range so fong as the pH used for nucleic acid cleaving is such 
that the enzymatic DNA molecule Is able to remain in an active conformation. An 
5 enzymatic DNA molecule in an active conformation Is easily detected by its ability to 

cleave single-stranded nucleic acid at a predetermined rHJcleotide sequence. 

In various embodiments, the nucleic acid cleaving conditions also include a 
variety of temperature ranges. As noted previously, temperature ranges consistent with 
physiological conditions are especially preferred, although temperature ranges consistent 

10 with industrial applications are also contemplated herein. In one embodiment, the 

temperature ranges from about 1 S'^C to about 60''C. In another variation, the nucleic 
acid cleaving conditions include a temperature ranging from about 30*C to about 56*C. 
In yet another variation, nucleic acid cleavage conditions include a temperature from 
about ZB^'C to about 50*C. In a preferred embodiment, nucleic acid cleavage conditions 

1 5 comprise a temperature range of about 37'C to about 42*^0. The temperature ranges 
consistent with nucteic acid cleaving conditions are constrained only by the desired 
cleavage rate and the stability of that particular enzymatic DNA molecule at that 
particular temperature. 

In various methods, the present invention contemplates nucleic acid cleaving 

20 conditions including the presence of a polyamine. Polyamtnes useful for practicing the 

present invention include spermidine, putrescine, spermine and the like. In one 
variation, the polyamine is present at a cortcentratlon of about .01 mM to about 10 mM. 
In another variation, the polyamine Is present at a concentration of about 1 mM to about 
10 mM. Nucleic acid cleavage conditions may also include the presence of polyamine at 

25 a concentration of about 2 mM to about 5 mM. In various preferred embodiments, the 

polyamine is spermidine. 
G. Vectors 

The present invention also features expression vectors Including a nucleic acid 
segment encoding an enzymatic DNA molecule of the present invention situated within 
30 the vector, preferably in a manner which allows expression of that enzymatic DNA 

molecule within a target cell (e.g., a plant or animal cell). 

Thus, In general, a vector according to the present Invention preferably Includes 
a plasmid, cosmid, phagemid, virus, or phage vector. Preferably, suitable vectors 
comprise single-stranded DNA (ssDNAI - e.g., circular phagemid ssDNA. h should also 
35 be appreciated that useful vectors according to the present invention need not be 

circular. 

In one variation, nucleotide sequences flanking each of the additional enzymatic 
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DNA molecule-encoding sequences are preferably provided, which sequences may be 
recognized by the first enzymatic DNA molecule. The intervening or flanking sequences 
preferably comprise at least 1 nucleotide; more preferably, intervening or flanking 
sequences are about 2-20 nucleotides In length, with sequences of about 5-10 
nucleotides in length being particularly preferred. 

The addition of polynucleotide tails may also be useful to protect the 3* end of 
an enzymatic DNA molecule according to the present invention. These may be provided 
by attaching a polymeric sequence by employing the enzyme terminal transferase. 

A vector according to the present invention includes two or more enzymatic 
DNA molecules. In one embodiment, a first enzymatic DNA molecule has intramolecular 
cleaving activity and is able to recognize and cleave nucleotide sequences to release 
other enzymatic DNA sequences; i.e., it is able to function to "release" other enzymatic 
DNA molecules from the vector. For example, a vector is preferably constructed so that 
when the first enzymatic DNA molecule is expressed, that first molecule is able to 
cleave nucleotide sequences flanking additional nucleotide sequences encoding a second 
enzymatic DNA molecule, a third enzymatic DNA molecule, and so forth. Presuming 
said first enzymatic DNA molecule (i.e., the "releasing" molecule) is able to cleave 
oligonucleotide sequences intramolecularly, the additional (e.g. second, third, and so on) 
enzymatic DNA molecules (i.e., the "released" molecules) need not possess 
choracteristics identical to the "releasing" molecule. For example, in one embodiment, 
the -released- (i.e., the second, third, etc.) fenzymatic DNA molecules are able to cleave 
spedfic RNA sequences, while the first {-releasing-) enzymatic DNA molecule has 
nuclease activity allowing it to Uberate the -released" molecules. In another 
embodiment, the "released" enzymatic DNA molecule has amide bond-cleaving activity, 
while the first ("releasing") enzymatic DNA molecule has nucleese activity. 

Alternatively, the first enzymatic DNA molecule may be encoded on a separate 
vector from the second (and third, fourth, etc.) enzymatic DNA moleculels) and may 
have intcrmolecular cleaving activhy. As noted herem, the first enzymatic DNA 
molecule can be a self-cleaving efuymatic DNA molecule {e.g., a deoxyribozyme), and 
the second enzymatic DNA molecule may be any desired type of enzymatic DNA 
molecule. When a vector is caused to express DNA from these nucleic acid sequences, 
tiiat DNA has the ability under appropriate conditions to cleave each of the flanking 
regions, thereby releasing one or more copies of the second enzymatic DNA molecule. 
If desired, several different second enzymatic DNA molecules can be placed in the same 
cell or carrier to produce different deoxyribozymes. It Is also contemplated that any one 
or more vectors may comprise one or more ribozymea or deoxyribozymes in any 
combination of "releasing- and "released" enzymatic nucleic acid molecules, as long as 
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such a combination achieves the desired result: the release of enzymatic nucleic acid 
molecules that are capable of cleaving predetermined nucleic acid sequences. 

Methods of isolating and purifying enzymatic DMA molecules of the present 
invention are also contemplated. In addition to the methods described herein, various 
5 purification methods <e.g. those using HPLC) and chromatographic isolation techniques 

are available in the art. See, e.g., the methods described in published international 
application no. WO 93/23569, the disclosures of which are incorporated herein by 
reference. 

It should also be understood that various combinations of the embodiments 
10 described herein are included within the scope of the present invention. Other features 

and advantages of the present invention will be apparent from the descriptions 
hereinabove, from the Examples to follow, and from the claims. 



15 The following examples Illustrate, but do not limit, the present Invention, 

Example 1 

In Vifro Evolution of Enzvmatic DMA Molecules: 

An PYgrvi^vY 

In vitro selection and in vitro evolution techniques allow new catalysts to be 

20 isolated without a priori knowledge of their composition or structure. Such methods 

have been used to obtain RNA enzymes with novel catalytic properties. For example, 
ribozymes that undergo autolytic cleavage with lead cation have been derived from a 
randomized pool of tRNA^ molecules <Pan and Uhlenbeck, Binchemistrv 31: 3887-3895 
(19921). Group I ribozyme variants have been isolated that can cleave DNA (Beaudry 

25 and Joyce, Science 257 : 635-641 (1992)) or that have altered metal dependence 

(Lehman and Joyce, Nature 361 : 182-186 (1993)). Starling with a pool of random RNA 
sequences, molecules have been obtained that catalyze a polymerase-like reaction 
(Bartel and Szostak, Science 261 : 141 M418 (1993)1. In the present example, 
refinement of specific catalytic properties of an evolved enzyme via alteration of the 

30 selection constraints during an in vitro evolution procedure Is described. 

Darwinian evolution requires the repeated operetion of three processes: (a) 
introduction of genetic variation; (b) selection of individuals on the basis of some fitness 
criterion; and (c) amplification of the selected Individuals. Each of these processes can 
be realized in vitro (Joyce, Gene 82 : 83 (1989)). A gene can be mutagenlzed by 

35 chemical modification, incorporation of randomized mutagenic ollgodeoxynudeotides, or 

inaccurate copying by a polymerase, (See, e.g.* Cadwell and Joyce, In PCR MgthPte 
and APDlications 2 : 28-33 (1992); Cadwell and Joyce, pfy Methods and Applications 3 
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«Stiopl.h S136-S140 n 994); Chu. et al., Virotoav 98: 168 (1979); Shortle. et al., MfiTtk 
FnrvmHl. 10O : 457 (19B3); Myers, et al.. Sri^nri. ??9: 242 (1985); MattBUCCi. et al., 
M..riBin Acldii Res. 11 ; 3113 (1983); Wells, et al.. fifing 34: 315 (1985): IVIcNeil, et al., 
Mni rsll. Biol. B : 3545 (1985); Hutchison, et al., PNftff l/SA 83: 710 11986); 
5 Derbyshire, et al., Gene 46 : 146 (1986); Zakour, et al., NfTli i re 2 95 : 708 11982); 

Lehtovaara. et al.. Protein Eno. 2 ; 63 (1988); Leung, et al., TWihniflue 1: 11 11989); 
Zhou, et a!.. H'r' ■^^'^^ ^"-^ ^052 (1991).) 

The gene product can be selected, for example, by Its abilty to bind a ligand or 
to carry out a chemical reaction. (See. e.g.. Joyce, JiL (1989); Robertson end Joyce, 

10 MatiifB 344 : 467 (1990); Tuerk. et al.. Sflience 249: 605 (1990).) The gene that 
corresponds to the selected gene product can be amplified by a reciprocal primer 
method, such as the polymerase chain reaction (PCR). (See, e.g., Saiki, et al., SfiifiOCfi 
230: 1350-54 (1985); Saiki, et al.. Science 239: 487-491 (1988).) 

Alternatively, nucleic acid amplification may be carried out using self-sustained 

1 5 sequence replication (3SR), (See. e.g., Guatelli. et al.. PNftSVSAS?: 1874 (1990). the 

disclosures of which are incorporated by reference herein.) According to the 3SR 
method, target nucleic acid sequences may be ampfified (replicated) exponentially bt 
vitro under isothermal conditions by using three enzymatic activities essential to 
retroviral replication: (1) reverse transcriptase. (2) RNase H. and (3) a DNA-dependent 

20 RNA polymerase. By mimicking the retroviral suategy of RNA replication by means of 

cDNA intermediates, this reaction accumulates cDNA and RNA copies of the original 
target. 

In summary, if one is contemplating the evolution of a population of enzymatic 
DNA molecules, a continuous series of reverse transcription and transcription reactions 
25 replicates an RNA target sequef.ce by means of cDNA intermediates. The crucial 

elements of this design are (a) the oligonucleotide primers both specify the target and 
contain 5' extensions encoding the T7 RNA polymerase binding site, so that the 
resultant cDNAs are competent transcription templates; (b) cDNA synthesis can proceed 
to completion of both strands due to the degradation of template RNA in the 

30 intermediate RNA-DNA hybrid by RNase H; and (c) the reaction products (cDNA and 

RNA) can function as templates for subsequent steps, enabling exponential replication. 

If one is evolving enzymatic DNA molecules, various critical elements of this 
design are somewhat different, as disclosed in those Examples. For instance. (1) the 
oligonucleotide primers specify the target and are preferabh^ 'marked' or labeled in 

35 some fashion - e.g., via biotinylation - so the resultant competent templete strands are 

easily Identified; and (21 the m vitro selection procedure used preferably depends upon 
the identification of the most favorable release mechanism. 
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A major obstacle to realizing Darwinian evolution In vitro Is the need to integrate 
mutation and amplification, both of which are genotype-related, with selection, which Is 
phenotYpe*related. In the case of nucleic acid enzymes, for which genotype and 
phenotype are embodied in the same molaculer the task is simplified. 
5 A. Design of Enzymatic PNA MqIccuIbs 

It is well icnown that single-stranded DNA can assume interesting tertiary 
structures. The structure of a "tDNA", for example, closely resemt>les that of the 
corresponding tRNA. fSee Paquette, et aL, Ftir. J. Biochem. 189: 259-265 (1990).) 
Furthermore, it has been possible to replace as many as 31 of 35 ribonucleotides within 

10 a hammerhead ribozyme, while retaining at least some catalytic activity. (See Perreault, 

et aL, Nature 344 : 565-567 (1990); Williams, et aK, Prq^. Natl. Arad- Sci. USA B9: 
918-921 (1992); Yang, et al., BiochBmistrv 31 : 5005-5009 (1992).) 

In vitro selection techniques have been applied to large populations of 
random-sequence DNAs, leading to the recovery of specific DNA 'aptamers' that bind a 

1 5 target ligand with high affinity (Bock, et al.. Nature 355 ; S64-566 (1 992); Ellington & 

Szostak. Nature 355 : 850-852 (1992); Wyatt & Ecker, PNAS USA 91: 1356-1360 
(1994)). Recently, two groups earned out the first NMR structural determination of an 
aptamer, a 1 5mer DNA that forms a G-quartet structure and binds the protein thrombin 
with high affinity (Wang, et aL, Binnhemistrv 32 : 1899-1904 (1993); Macaya, et al., 

20 PNAS USA 90 : 3745-3749 (1993)), These findings were corroborated by an X-ray 

crystallographic analysis (Padmanabhan, et al., J. BloL Chcm. 268: 17651-17654 
(1993)). 

The ability to bind a substrate molecule with high affinity and specificity is a 
prerequisite of a good enzyme. In addition, an enzyme must make use of 

25 well-positioned functional groups, either within itself or a cofactor, to promote a 

particular chemical transformation. Furthermore, the enzyme must remain unchanged 
over the course of the reaction and be capable of operating with catalytic turnover. 
Some would add the requirement that it be an informational macromolecule, comprised 
of subunits whose specific ordering Is responsible for catalytic activity. While these 

30 criteria are open to debate on both semantic and chemical grounds, they serve to 

distinguish phenomena of chemical rate enhancement that range from simple solvent 
effects to biological enzymes opereting at the limit of substrate diffusion (Albery & 
Knowles, pinchemistrv 15 : 5631-5640 (1976)). 

As described in greater detail hereinbelow, we sought to develop a general 

35 method for rapidly obtaining DNA catalysts and DNA enzymes, starting from random 

sequences. As an initial target, we chose a reaction that we felt was well within the 
capability of DNA: the hydrolytic cleavage of an RNA phosphodiester, assisted by a 
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divalem metal cof actor. TWs is tha same reaction that is carried out by a variety of 
naturallv-occurring RNA enzymes, including the hammerhead and hairpin motifs. (See, 
e.g., Forster A.C. & Symons R.H.. CpA\ 49 = 21 1-220 (1987); Uhlenbeck. UsmL32&-- 
596-600 (1987); Hampel & Tritz, pinrhBmistrv 28: 4929-4933 (1989)). 
5 It has recently been shown that, beginning with a randomized library of tRNA 

molecules, one can obtain ribozymesthat have Pb»*-dependent. slte-spaolfic RNA 
phosphoesterase activity at neutral pH (Pan & Uhlenbeck. ffinrHftinlSTrV 31= 3887-3895 
(1992): Pan & Uhlenbeck. NaOttfi^: 560-563 (1992)). This Is analogous to the 
fortuitous self-cleavage reaction of yeast tRNA'^ (Dirheimer 8. Werner. Blnchlmie 54 = 

10 127-144 (1972)). wrhich depends on specific coordination of a Pb'» ion at a defined site 

within the tRNA. (See Rubin & Sundaralingam, | PtPmftl f^Tnifit. PYn . 1 = 639-646 
(1983): Brown, et a!.. RinrhBmistrv 24; 4785-4801 (1985).) 

AS disclosed herein, our goals included the development of DMAs that could 
carry out Pb»*-dependent cleavage of a particular RNA phosphoester. initially presented 

15 within a short leader sequence attached to the 5' end of the DNA, and ultimately 

located within a separate molecule that could be cleaved in an intermolecular fashion 
with rapid catalytic turnover. These goals were succeasf u»y achieved, as described 
further below. 

No assumptions were made as to how the DNA would Interact with the target 
20 phosphoester and surrounding nucleotides. Beginning with a pool of approximately 10" 
random 50mer sequences, in vitro selection was allowed to run its course. After five 
rounds Of selection carried out over four days, the population as a whole had attained 
the ability to cleave the target phosphoester in the presence of 1 mM Pb»* at a rate of 
about 0.2 mln•^ This is an approximately 10»-fold Increase compared to the 
25 spoirtaneous rate of cleavage under the same reection conditions. 

Individuals were Isolated from the population, sequenced, and assayed for 
catalytic activity. Based on this information, the reaction was converted to an 
intermolecular format and then simplified to allow site-specific cleavage ol a ISmer 
substrate by a 38mer DNA enzyme. In a reaction that proceeds with a turnover rate of 1 
30 mln ' at 23'C and pH 7.0 in the presence of 1 mM PbOAc. 

B. In yjHrn saiertion Scheme 

A starting pool of approximately 10- single-stranded DNA molecules was 
ganerated, all of which contain a 5' biotin moiety, followed successively by a fixed 
domain that includes a single ribonucleotide, a potentiel catalytic domain compr«ed of 
50 random deoxyribonucleotides, and a second fixed domain that lay at the 3' temtmus 
(Fig. 1). 

The pool was constructed by a nested PGR (polymerase chain reaction) 



35 



wo 96/17096 



PCTAJS9S/15S80 



-34- 

technique, beginning with synthetic DIMA that contained 50 random nucleotides flanked 
by primer binding sites. The nested PCR primer was a 5'-blotinyiated synthetic 
oligodeoxynucleotide with a 3'-terminal adenosine ribonucleotide. 
Ribonucleottde-terminated oligonucleotides efficiently prime temp)ate*directed elongation 
5 in the context of the PCR (L.E. Orgel, personal communication), in this case giving rise 

to an extension product that contains a single embedded ribonucleotide. 

Figure 1 illustrates a selective amplification scheme for isolation of DNAs that 
cleave a target RNA phosphoester. Double-stranded DNA containing a stretch of 50 
random nucleotides is amplified via PCR, employing a 5'-biotlnylated DNA primer (e.g., 

10 primer 3 - 3a or 3b) terminated at the 3' end by an adenosine ribonucleotide 

(represented by the symbol "N" or "rA", wherein both N and rA represent an adenosine 
ribonucleotide). This primer is extended by Tag polymerase to yield a DNA product that 
contains a single embedded ribonucleotide. The resulting double-stranded DNA Is 
immobilized on a streptavidin matrix and the unbiotinylated ONA strand is removed by 

1 5 washing with 0.2 N NaOH. After re-equilibrating the column with a buffered solution, 

the column is washed with the same solution with added 1 ml^ PbOAc. DNAs that 
undergo Pb^^ -dependent self-cleavage are released from the column/ collected in the 
eluant, and amplified by PCR. The PCR products are then used to initiate the next round 
of selective amplification. 

20 The PCR products were passed over a streptavidin affinity matrix, resulting In 

noncovalent attachment of the 5'-biotinylated strand of the duplex DNA. The 
nonbiotinylated strand was removed by brief weshing with 0.2 N NaOH, and the bound 
strand was equilibrated in a buffer containing 0.5 M NaCI, 0.5 M KCl, 50 mM MgClj, 
and 50 mM HEPES (pH 7.0) at 23*C. Next, 1 mM PbOAc was provided in the same 

25 buffer, allowing Pb^* -dependent cleavage to occur at the target phosphoester, thereby 

releasing a subset of the DNAs from the streptavidin matrix. In principle, an individual 
DNA might facilitate its own release by various means, such as disruption of the 
interaction between biotin and streptavidin or cleavage of one of the 
deoxyribonucleotide linkages. It was felt that cleavage of the ribonucleoside 3'-0-F 

30 bond would be the most likely mechanism for release, based on the relative lability of 

this linkage, and that Pb'*-dependent hydrolytic cleavage would allow release to occur 
most rapidly. In principle, however, the in vitro selection procedure should identify the 
most favorable release mechanism as well as those individuals best able to carry out 
that mechanism. 

35 DNA molecules released from the matrbc upon addition of Pb''' were collected In 

the eluant, concentrated by precipitation with ethanol, and subjected to nested PCR 
amplification. As In the construction of the starting pool of molecules, the first PCR 



WO9</17086 



PCTAIS95/15SS0 



-35- 

amplification utilized primers that flank the random region (primers 1 and 2) and the 
second utilized a B'-biotinvlated primer (primer 3b) that has a 3 --terminal riboadenylate, 
thereby reintroducing the target RNA phosphoester. The entire selective amplification 
procedure requires 3-4 hours to perform. 
5 The molecules are purified in three ways during each round of this procedure: 

first, following PCR amplification, by extracting twice with phenol and once with 
chloroform / isoamyl alcohol, then precipitating with ethanolj second, following 
attachment of the DNA to streptavidln. by washing away all the nonbiotinylated 
molecules under strongly denaturing conditions; and third, following elution with Pb» 
1 0 by precipitating with ethanol. There is no gel electrophoresis purification step, and thus 
no selection pressure constraining the molecules to a particular length. 
C. fiPlPrtinn of Catalytic DNA 

We carried out five successive rounds of in vitro selection, progressivehr 
decreasing the reaction time following addition of Pb»* in order to progresshrely increase 
1 5 the stringency of selection. During rounds 1 though 3. the reaction time was 1 hour; 

during round 4. the reaction time was 20 minutes; and during round 5. it was 1 minute. 
The starting pool of slngle-strandad DNAs. together with the population of molecules 
obtained after each round of selection, was assayed for self-cleavage activity under 
conditions identical to those employed during in vitro selection (see Fig. 2). 
20 For this essay, the molecules were prepared with a 5'-«P rather than a S'-biotin 

moiety, allowing detection of both the starting material and the 5' cleavage product. 
Following a 5-mlnute incubation, there was no detectable activity in the initial pool (GO) 
or in the population obtained after the first and second rounds of selection. DNAs 
obtained after the third round IG3) exhibited a modest level of activity; this activity 
25 Increased steadily, reaching approximately 50% self-cleavage for the DNAs obtained 
after the fifth round of selection (Q5). Cleavage was detected only at the target 
phosphoester, even after long incubation times. This activity was lost if Pb»» was 
omitted from the reaction mixture. 

Figure 2 Mlustrates the self-cleavage activity of the starting pool of DNA (GO) 
30 and populations obtained after the first through fifth rounds of selection (01 - G5). 

Reaction mixtures contained 50 mM MgCI,, 0.5 M NaCI, 0.5 M KCI, 50 mM HEPES (pH 
7.0 Bt 23*C), and 3 nM I5'-"P)-l8beled DNA, incubated at 23'C for 5 min either in the 
presence or in the absence of 1 mM PbOAc. The symbol Pre represents 108-nucleotide 
precursor DNA (SEQ ID NO 4); Civ, 28.nucleotide 5'-cleavage product (SEQ ID NO 5); 
35 and M, primer 3a (SEQ ID NO 6), corresponding In length to the B'-deavage product. 

The 28^njcleotide 5' cleavage product (Chr) Illustrated preferably has the 
sequence 5--6GGACGAATTCTAATACGACTCACTATN-3'. wherein "N" represents 
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Bdenosine ribonucleotide with an additional 2% 3'-cyclic phosphate on the 3* end (SEQ 
ID NO 5). In alternative embodiments, "N" represents adenosine ribonucleotide with an 
additional 2' or 3* phosphate on the 3* end of the molecule. 

In Figure 2, the "GO" lane "Pre" band comprises a sampling of 10B-nucleotide 
5 precursor DNAs that eech include 50 random nucleotides. Therefore, any given "Pre" 

sampling win contain a wide variety of precursor DNAs, and each sampling will likely 
differ from previous and subsequent samplings. The "Gl" through "G5* lanes contain 
*Pre" bands that are increesingiy enriched for catalytic DNA molecules, but still contain 
a large number of different DNA sequences (i.e., differing in the 50 nucleotide 

10 randomized domain). A sample of these different sequences from "G5 Pre" DNA Is 

provided in Figure 3. 

Shotgun cloning techniques were employed to Isolate individuals from the G5 
population; the complete nucleotide sequences of 20 of these subclones were then 
determined (see Fig. 3). (Also see, e.g., Cadwell and Joyce, in PGR Methods and 

1 5 ADPllMtignS 2: 28-33 (1 992); Cadwell end Joyce, PGR Methods and Applications 3 

ISumlii: S136-S140 (1994).) Of the 20 sequences, five were unique, two occurred 
twice, one occurred three times, and one occurred eight times. All of the individual 
variants share common sequence elements within the 50-nuc!eotlde region that had 
been randomized in the starting pool of ONA. They all contain two presumed template 

20 regions, one with complementarity to a stretch of nucleotides that lies just upstream 

from the cleavage site and the other with comptementarhy to nucleotides that lie at 
least four nucleotides downstream. Between these two presumed template regions lies 
a variable domain of 1-11 nucleotides, followed by the fixed sequence 5*-AGCG*3', then 
a second variable domain of 3-8 nucleotides, and finally the fixed sequence 5*-CG*3* or 

25 5*-CGA-3'. Nucleotides that lie outside of the two presumed template regions are highly 

variable in both sequence and length. In all of the sequenced subclones, the region 
corresponding to the 50 Initially-randomized nucleotides remains a total of 50 
nucleotides in length. 

Figure 3 illustrates the sequence alignment of individual variants isolated from 

30 the population after five rounds of selection. The fixed substrate domain (5*- 

GGGACGAATTCTAATACGACTCACTATrAGGAAGAGATGGCQAC-3', or 5*- 
GGGACGAATTCTAATACGACTCACTATNGGAAGAGATGGCGAC-3\ where N represents 
adenosine ribonucleotide) (SEQ ID NO 13) is shown at the top, with the target 
riboadenylate identified with an Inverted triangle. Substrate nucleotides that are 

35 commonly involved in presumed base-pairing Interactions ere indicated by a vertical bar. 

Sequences corresponding to the 50 initially-randomized nucleotides are aligned 
antiparallel to the substrate domein. All of the veriants are 3'-terminated by the fixed 
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sequence 5'-CGGTAAGCTTGGCAC-3' ISEQ ID NO 1) Tprimcr site"; not shown). 
Nucleotides within the initially-randomized region that are presumed to form base pairs 
with the substrate domain are indicated on the right and left sides of the Figure; the 
putative base-pair-forming (or substrate binding) regions of the enzymatic DNA 
molecules are Individually boxed in each sequence shown. The highly-conserved 
nucleotides within the putative catalytic domain are illustrated In the two boxed 
columns. 

While it is anticipated that additional data will be helpful in constructing a 
meaningful secondary structural model of the catalytic domain, we note that, like the 
hammerhead and hairpin ribozymes, the catalytic domain of our enzymatic DNA 
molecules appears to contain a conserved core flanked by two substrate binding regions 
(or recognition domains) that interact with the substrate through base-pairing 
Interactions, Similar to the hammerhead and hairpin ribozymes, the catalytic DNAs also 
appear to require a short stretch of unpaired substrate nucleotides - In this case 
5'-GQA-3* between the two regions that are Involved In bese pairing. 

It was also interesting to note that each of the nine distinct variants exhibited a 
different pattern of presumed complementarity with the substrate domain. In some 
cases, base pairing was contiguous, while in others it was interrupted by one or more 
noncomplementery pairs. The general tendency seems to be to form tighter Interaction 
with the nucleotides that lie upstream from the cleavage site compared to those that lie 
downstream. Binding studies and site-directed mutagenesis analysis should enable us to 
gain further insights and to further substantiate this conjecture. 

In order to gain further insight into the sequence requirements for catalytic 
function, the self-cleavage activity of six of the nine variants was tested and evaluated 
under the withln-described selection conditions (see Fig. 3). Not surprisingly, the 
sequence that occurred In eight of the 20 subclones proved to be the most reactive, 
with a first-order rate constant of 1.4 m\n \ AH of the studied variants were active In 
the self-cleavage assay and all gave rise to a single 5'-labeled product corresponding to 
cleavage at the target RNA phosphoester. 

The dominant subclone was further analyzed under a variety of reaction 
conditions. Its self-cleavage activity was dependent on Pb'-* but was unaffected If 
Mg** was omitted from the reaction mixture. There was a requirement for a 
monovalent cation as well, which can be met by either Na* or K^ The reaction rate 
increased linearly with increasing concentration of monovalent cation over the range of 
0 - 1.0 M (r = 0.998). Other variables that may affect the reaction, such as pH, 
temperature, and the presence of other divalent metals, are in the process of being 
evaluated further. 
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Example 2 

Matariais and Methods 

A. Oligonucleotides and Olioonucleotide AnaioQs 

Synthetic DNAs and DNA analogs were purchased from Operon Technologies. 
5 The 19-nucleotide substrate. 5'-pTCACTATrAGGAAGAGATGG-3* (or 5'- 

pTCACTATNGGAAGAGATGG*3', wherein "N* represents adenosine ribonucleotide) 
<SEQ ID NO 7), was prepared by reverse-transcriptase catalyzed extension of 
5*-pTCACTATrA*3' {or 5'-pTCACTATN-3S wherein "N** represents adenosine 
ribonucleotide) (SEQ ID NO 8), as previously described (Breaker, Banerji, & Joyce, 
10 Biochemistry 33 : 11980-11986 (1994)), using the template 

5^CCATCTCTTCCTATAGTGAGTCCGGCTGCA-3' (SEQ ID NO 9). Primer 3, 5'- 
GGGACGAATTCTAATACGACTCACTATrA-3' (or 5*- 

GGGACGAATTCTAATACGACTCACTATN-3\ wherein "N" represents adenosine 
ribonucleotide) (SEQ ID NO 6), was either S'-labeled with |Y-^^P]ATP and T4 
1 5 polynucleotide kinase (primer 3a) or 5*-thiophosphorylated with [y-SlATP and T4 

polynucleotide kinase and subsequently bfottnylated with Miodoacetyl-ZV'- 
biotinylhexylenediamine (primer 3b). 

B. DNA Pool Preparation 

The starting pool of DNA was prepared by PCR using the synthetic oligomer 
20 5'-GTGCCAAGCTTACCG-N„-QTCGCCATCTCTTCC-3' (SEQ ID NO 4), where N is an 

equimolar mixture of A, T and C. A 2-ml PCR, containing 500 pmoles of the 
randomized oligomer, 1 ,000 pmoles primer 1 (5'-GTGCCAAGCTTACCG-3\ SEQ ID NO 
10), 500 pmoles primer 2 

(5'-CTGCAGAATTCTAATACGACTCACTATAGGAAGA6ATGGCGAC-3\ SEQ ID NO 11), 
25 500 pmoles primer 3b, 10 ^Ci (a-"PIdATP, and 0,2 U /il ' Tag DNA polymerase, was 

incubated in the presence of 50 mM KCl, 1.5 mM MgCI,, 10 mM Tris-HCi IpH 8.3 at 
23*'C), 0.01 % gelatin, and 0.2 mM of each dNTP for 1 min at 92*Ci 1 min at 50"C, and 
2 min at 72''C, then 5 cycles of 1 min at 92"C. 1 min at 50"C, and 1 min at 72'*C. The 
resulting mixture was extracted twice with phenol and once with chloroform / isoamy) 
30 alcohol, and the DNA was isolated by precipitation with ethanol. 

C. In Vitro Selection 

The starting pool of DNA was resuspended in 500 yX, of buffer A (1 M NaCt and 
50 mM HEPES (pH 7.0 at 23''C)} and was passed repeatedly over a streptavidin column 
(AfflnlTip Strep 20, Genosys, The Woodlands, TX). The column was washed with five 
35 100-/il volumes of buffer A, followed by five 100-^1 volumes of 0.2 N NaOH, then 

equMibrated with five 100-^1 volumes of buffer B (0.5 M NaCI, 0.5 M KC), 50 mM 
MgClj, and 50 mM HEPES (pH 7.0 at 23''C)). The immobilized single-stranded DNA was 
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flluted over the course 6i 1 hr with three 20-mI volumes of buffer B v^ith added 1 mM 
PbOAc. The entire immobilization and elution process was conducted at 23°C. The 
eluant was collected in an equal volume of buffer C (50 mM HEPES (pH 7.0 at 23X» 
and 80 mM EDTA) and the DNA was precipitated with ethanol. 

The resuhing DNA was amplified in a ^00nL PCR containing 20 pmoles primer 
1, 20 pmoles primer 2. 0.05 U Tai, polymerase, 50 mM KCI. 1.5 mM M9CI,. 10 mM 
Tris-HCI (pH 8.3 at 23*C), 0.01 % gelatin, and 0.2 mM of each dNTP for 30 cycles of 
10 sec at 92'C. 30 sec at 50*C. and 30 sec at 72'C. The reaction products were 
extracted twice with phenol and once with chloroform / isoamyl alcohol, and the DNA 
was recovered by precipitation with ethanol. Approximately 4 pmoles of the amplified 
DNA was added to a second, nested PCR containing 100 pmoles primer 1, 100 pmoles 
primer 3b, 20 ^Ci la-'^PldATP, and 0.1 U ,xV Tag polymerase, in a total vohjme of 200 
liL that was amplified for 10 cycles of 1 min at 92'C. 1 min at SO'C. and 1 min at 
72'C. The PCR products were once more extracted and precipitated, and the resulting 
DNA was rasuspended in 50 buffer A. then used to begin the next round of 
selection. 

The second and third rounds were carried out as above, except that the nested 
PCR at the end of the third round was performed in a 100-,.l volume. During the fourth 
round, the elution time following addition of Pb^* was reduced to 20 min (two 20-mL 
elution volumes) and only half of the recovered DNA was used In the first PCR, which 
involved only 1 5 temperature cycles. During the fifth round, the elution time was 
reduced to 1 min (two 20-;iL elution volumes) and only one-fourth of the recovered DNA 
was used In the first PCR. which invohred 1 5 temperature cycles. DNA obtained after 
the fifth round of selection was subcloned and sequenced, as described previously 
ITsang & Joyce, pinrh^mi^trv 33: 5966-5973 (1994)). 
D. r^m"^^ Analvfjlfi ftf raialvtic DNAs 

Populations of DNA and various subcloned Individuals were prepared with a 
5--«P label by asymmetric PCR in a 25-,xl reaction mixture containing 10 pmoles primer 
3a, 0.5 pmoles input DNA, and 0.1 U ^1' Tag polymerase, under conditions as described 
above, for 10 cycles of 1 min at 92'C. 1 min at 50»C. and 1 min at 72-C. The 
resuHing (5'-«PI-labelcd amplification products were purified by elecuophoresis in a 
10% polyacrylamide / 8 M gel. 

Self-cleavage assays were carried out following preincubation of the DNA in 
buffer B for 10 min. Reactions were initiated by addition of PbOAc to 1 mM final 
concenuation and were temiinated by addition of an equal volume of buffer C. Reaction 
products wore separated by electrophoresis in a 10% polyacrylamide / 8 M gel. K.net.c 
assays under multiple-turnover conditions were carried out in buffer B that included 50 
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ptjQ m\'^ BSA to prevent adherence of material to the vessel walls. Substrate and enzyme 
molecules were preincubated separately for 6 min In reaction buffer that lacked Pb^^, 
then combined, and the reaction was initiated by addition of PbOAc to a final 
concentration of 1 mM. 

Exampte 3 

Evolution Df DBnyvrtbozvmcs 
That Cleave Intermoleculafiy 

A. Conversion to an Intermolecular Format 

Based on the variable pattern of presumed base-pairing interactions between the 

catalytic and substrate domains of the studied variants, it was felt that it would be 

reasor>ably straightforward to convert the DNA-catalyzed reaction to an Intermolecular 

format. In doing so, we wished to simplify tha two substrate-binding regions of the 

catalyst so that each would form an uninterrupted stretch of 7-S base pairs with the 

substrate. In addition, we wished to provide a minimal substrate, limited to the two 

base-pairing regions end the intervening sequence 5*-GGA-3' (Fig. 4A). 

Figures 4A and 4B illustrate DNA-catalyzed cleavage of an RNA phosphoester in 

an intermolecular reaction that proceeds with catalytic turnover. Figure 4A is a 

diagrammatic representation of the complex formed between the ISmer substrate and 

38mer DNA enzyme. The substrate contains a single edertosine ribonucleotide CrA' or 

**N'', adjacent to the arrow), flanked by deoxyribonucteotldes. The synthetic DNA 

enzyme is a 38-nuc)eotide portion of the most frequently occurring variant shown in Fig. 

3. Highly-conserved nucleotides located within the putative catalytic domain are 

"boxed". As illustrated, one conserved sequence is "AGCG", while another is "CG" 

(reading in the 5*^3' direction). 

Figure 4B shows an Eadle-Hofstee plot used to determine K„ (negath/e slope) 

VfflM (y-intercept) for DNA-catalyzed cleavage of [5'-'^P)-labeled substrate under 

conditions identical to those employed during In vitro selection. Initial rates of cleavage 

were determined for reactions involving 5 nM DNA enzyme and either 0.125, 0.5, 1, 2, 

or 4 /iM substrate. 

In designing the catalytic domain, we relied heavily on the composition of the 
most reactive variant, truncating by two nucleotides at the 5* end end 1 1 nucleotides at 
the 3' end. The 15 nucleotides that lay between the two template regions were left 
unchanged and a single nucleotide was inserted into the 3* template region to form a 
continuous stretch of nucleotides capable of forming base pairs with the substrate. The 
substrate was amplified to the sequence S'- TCACTATrA • GG AAGAGATG G-3* (or 
5* -TCACTATN • GG AAGAGATGG -3'. wherein •N" represents adenosine ribonucleotide) 
CSEQ ID NO 12), where the underlined nucleotides correspond to the two regions 
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involved in base pairing with the catalytic DNA molecule. 

The simplified reaction system, employing a 38mer catalytic DNA molecule 
(catalyst) comprised entirely of deoxyribonucieotldes and a 19mer substrate containing a 
single ribonucleotide embedded within an otherwise all-DNA sequence, aUows efficient 
5 DNA-calalyzed phosphoester cleavage with rapid turnover. Over a 90-ininute incubation 

in the presence of 0.01 (M catalyst and 1 (iM substrate, 46% of the substrate is 
claaved, corresponding to 46 turnovers of the catalyst. A preliminary kinetic analysis of 
this reaction was carried out, evaluated under multiple-turnover conditions. The DNA 
catalyst exhibits Michaelis-Menten kinetics, with values for k„, and of 1 min ' and 2 

10 AxM, respectively (see Fig. 4B). The value for K„ Is considerably greater than the 

expected dissociation constant between catalyst and substrate based on Watson-Crick 
interactions. The substrate was incubated under Identical reaction conditions (but in the 
absence of the catalyst!; a value for k„^, of 4 » 10'^ min ' was obtained. This is 
consistent with the reported value of 5 x lO' min*^ for hydrolysis of the more labile 

1 5 1-nitrophBnyl-1,2-propanediol in the presence of 0.5 mM Pb^* at pH 7,0 and 37*C 

(Breslow & Huang, PMAS USA 88: 4080-4083 (1991)), 

It Is now presumed that the phosphoester cleavage reaction proceeds via a 
hydrolytic mechanism Involving attack by the ribonucleoside 2 -hydroxyl on the vicinal 
phosphate, generating a 5' product with a terminal 2*(3*)-cyclic phosphate and 3' 

20 product vwth a terminal B'-hydroxyl. In support of this mechanism, the 3*-cle8vage 

product Is efficiently phosphorylated with T4 polynucleotide kinase and [y-^'PlATP, 
consistent with the availability of a free 6*-hydroxyl (data not shown). 
B. Discussion 

After five rounds of h vitro selection, a population of single-stranded DNA 
25 molecules that catalyze efficient Pb^^-depondent cleavage of a target RNA phosphoester 

was obtained. Based on the common features of representative Individuals isolated 
from this population, a simplified version of both the catalytic and substrate domains 
was constructed, leading to a demonstration of rapid catalytic turnover in an 
intennolBCular context. Thus the 38mer catalytic domain provides en example of a DNA 
30 enzyme, or what might be tenmed a "deoxyrlbozyme". 

Referring to this molecule as an enzyme, based on the fact that it is an 
informational macromolecule capable of accelerating a chemical transformation in a 
reaction that proceeds with rapid turnover and obeys Michaelis-Menten kinetics, may 
not satisfy everyone's notion of what constitutes an enzyme. Some might insist that an 
35 enzyme, by definition, must be a polypeptide. If, however, one accepts the notion of an 

RNA enzyme, then it seems reasonable to adopt a similar view concerning DNA 
enzymes. Considering how quickly we were able to generate this molecule from a pool 
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of random-sequence DNAs, we expect that many other examples of synthetic DNA 
enzymes will appear in the near future. 

The Pb^*-dependent cleavage of an RNA phosphoester was chosen as an initial 
target for ONA catalysis because it is a straightforward reaction that simply requires the 
5 proper positioning of a coordinated Pb^'^-hydroxyl to facilitate deprotonation of the 2 ' 

hydroxyl that lies adjacent to the cleavage site. (See, e.g.. Pan, at al., in The RNA 
Wodd* Gesteland & Atkins (eds.), pp. 271-302, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, NY (19931.) Pb^*** is known to coordir^te to the N7 position of 
purines, the 06 position of guanine, the 04 position of uradi, and the N3 position of 

10 cytosine (Brown, et al.. Nature 303 : 543-546 (1993)). Thus, the differences In sugar 

composition and conformation of DNA compared to RNA seemed unlikely to prevent 
DNA from forming a weil-<Jefined Pb^^-binding pocket. 

A substrata that contains a single ribonucleotide within an otherwise all-DNA 
sequence was chosen because it provided a uniquely favored site for cleavage and 

1 5 insured that any resulting catalytic activity would be attributable solely to DNA. 

Substrate recognition appears to depend on two regions of base-pairing interactions 
between the catalyst and substrate. However, the unpaired substrate nucleotidesi 
5'*GGA-3*, that lie between these two regions may play an important role in substrate 
recognition, metal coordination, or other aspects of catalytic function. 

20 It is further anticipated that an all-RNA molecule, other RNA-DNA composites, 

and molecules containing one or more nucleotide analogs may be acceptable substrates. 
As disclosed herein, the within-described in vitro evolution procedures may successfully 
be used to generate enzymatic DNA molecules having the desired specificities; further 
analyses along these lines are presently underway. 

25 In addition, studies to determine whether the presumed base-pairing interactions 

between enzyme and substrate are generalizable with respect to sequence are in 
progress, using the presently-described methods. The within-disclosed Pb'^-dependent 
deoxyribozymes may also be considered model compounds for exploring the structural 
and enzymatic properties of DNA. 

30 The methods employed in the present disclosure for the rapid development of 

DNA catalysts will have considerable generality, allowing us to utilize other cofactors to 
trigger the cleavage of a target linkage attached to a potential catalytic domain. In this 
regard, the development of Mg^^-dependent DNA enzymes that specifically cleave 
target RNAs under physiological conditions is of imeresti as is the development of DNA 

35 enzymes that function in the presence of other cations (see Example 4). Such 

molecules will provide an alternative to traditional antisense and ribozyme epproaches 
for the specific inactivation of target mRNAs. 
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DNA thus joins RNA and protein on the list of biological macromolecules that ere 
capable of exhibiting enzymatic activity. The full extent of DNA's catalytic abilities 
remains to be explored, but these explorations should proceed rapidly based on !n vitro 
selection methods such as those employed in this study. 
5 ONA enzymes offer several important advantages compared to other 

macromolecular catalysts. First, they are easy to prepare, in an era when most 
laboratories have access to an automated DNA synthesizer and the cost of DNA 
phosphoramidites has become quite modest. Second, they are very stable compounds, 
especially compared to RNA, thus facilitating their use in biophysical studies. Third, we 

10 expect that they can be adapted to therapeutic applications that at present make use of 

antlsense DMAs that lack RNA-cleavsge ectivlty. In vitro selection could be carried out 
with DNA analogs, including compounds that are nuclease resistant such as 
phosphorothloate-containing DNA, so long as these analogs can be prepared in the form 
of a deoxynucleoside S'-triphosphate and are accepted as a substrate by a 

1 5 DNA-dependent DNA polymerase. Finally, DNA enzymes offer a new window on our 

understanding of the macromolecular basis of catalytic function. It will be interesting, 
for example, to carry out comparative analyses of protein-, RNA-, and DNA-based 
enzymes that catalyze the same chemical transformation. 

Example 4 

20 Qthr^f Families of Catfllvtlc DNAs 

A starting pool of DNA was prepared by PGR essentially as described In Example 
2.B. above, except that the starting pool of DNA comprised molecules containing 40 
random nucleotides. Thus, the starting pool of DNA described herein was prepared by 
PGR using the synthetic oligomer 5 ' GGG ACG AAT TCT AAT ACG ACT CAC TAT rA 

25 GG AAG AGA TGG CGA CAT CTC N40GT GAC GGT AAG CTT GGC AC 3 ' (SEQ ID NO 

23), where N is an equlmolar mixture of G, A, T end C, and where the DNA molecules 
were selected for the ability to cleave the phosphoester following the target rA. (See 
Figure 6A, also.) 

Selective amplification was carried out In the presence of cither Pb^%Zn*\Mn*% 
30 or Mg*\ thereby genereting at least four -families- of catalytic DNA molecules. As 

illustrated in Figure 5, catalytic DNA molecules demonstreting specific activity were 
generated in the presence of a variety of cations. 

Figure 5 is a photographic representation showing a polyacrylamide gel 
demonstrating specific endoribonuclease activity of four families of selected catalytic 
35 DNAs. Selection of a Pb^* -dependent family of molecules was repeated in a side-by- 

side fashion as a control. In each group of three lanes* the first lane shows the lack of 
activity of the selected population in the absence of the metal cation, the second lane 
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show$ the observed activity in the presence of the metal cation, and the third lane 
shows the lack of activfty of the starting pool IGO). At present, the order of reactivity is 
observed to be Pb^*>Zn^'^ >Mn'* >Mg**, mirroring the pK, of the corresponding metal- 
hydroxide. 

5 After either five (G5) or six (G6) rounds of selective amplification in the presence 

of the preselected divalent cation, the desired endonuclease activity was obtained. The 
following description of selective amplification in the presence of Mg'^ Is intended to be 
exemplary. 

Six rounds of in vitro selective amplification were carried out, following the 
1 0 method described in Example 2 hereinabove, except that the divalent metal used was 1 

mM Mg^'^ rather than 1 mM Pb^^. (See also Breaker and Joyce, Chem. & BioL 1 i 
223-229 (1994), incorporated by reference herein, which describes essentially the same 
procedure.) 

Individual clones were isolated following the sixth round, and the nucleotide 
15 sequence of 24 of these clones was determined. All of the sequences began with: 5 ' 

G6G ACG AAT TCT AAT ACG ACT CAC TAT rA GG AAG AGA TGG CGA CA ISEQ ID 
NO 23 from position 1 to 44) and ended with; CGG TAA CCT TGG CAC 3 ' (SEQ ID 
NO 23 from position 93 to 107). 

The segment in the middle, corresponding to TCTC GTGA (SEQ ID NO 23 
20 from position 45 to 92) in the starting pool, varied as follows: 

II 3) CCG CCC ACC TCT TTT ACG AGC CTG TAC GAA ATA GTG CTC TTG 

TTA GTA T (SEQ ID NO 24) 
(5) TCT err CAG CGA TGC ACG CTT GTT TTA ATG TTG CAC CCA TGT 
25 lAG TGA (SEQ ID NO 25} 

(2) TCT CAT CAG CGA TTG AAC CAC TTG GTG GAC AGA CCC ATG TTA 

GTG A (SEQ ID NO 26) 
(1 ) CCG CCC ACC TCT TTT ACG AGC CTG TAC GAA ATA GTG TTC TTG 
TTA GTA T (SEQ ID NO 27) 
30 (1 ) CCG CCC ACC TCT TTT ACQ AGC CTG TAC GAA ATA GTG CTC TCG 

TTA GTA T (SEQ ID NO 28) 
(1) TCT CAG ACT TAG TCC ATC ACA CTC TGT GCA TAT GCC TGC TTG 

ATG TGA (SEQ ID NO 29) 
(1 1 -CT CTC ATC TGC TAG CAC GCT CGA ATA GTG TCA GTC GAT GTG A 
35 (SEQ ID NO 30). 
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The Initial number In parentheses indicates the number of clones having that 
particular sequence. Note that some mutations (highlighted in bold type) occurred at 
nucleotide positions other than those that were randomized Initially. 

The second sequence listed above (i.e., SEQ ID NO 25), which occurred in 5 of 
5 24 clones, was chosen as a lead (i.e. principal) compound for further study. Its 

cleavage activity was measured in the presence of a 1 mM concentration of various 
divalent metals and 1 M NaCI at pH 7.0 and 23*0: 



metal Kt» ImJ^'^) 

10 none n.d. 

Mg^+ 2.3 X 10^ 

Mn^* 6.8x10-' 
Zn'* 4.2 X 10'^ 

Pb'* 1.1 X 10-' 

15 

Thus, the lead compound is active in the presence of ail four divalent metals, 
even though it was selected for activity in the presence of Mg*\ Conversely, DNA 
molecules that were selected for activity in the presence of Mn*\ Zn»% or Pb^* did not 
show any activity in the presence of Mg^*. 

20 In addition, the population of DNAs obtained after six rounds of in vitro selection 

in the presence of Mg'*, when prepared as all-phosphorothioate-containing DNA 
analogs, showed Mg'* -dependent cleavage activity at an observed rate of -lO^ min \ 
The phosphorothioate-contolning analogs were prepared enzymatically so as to have an 
/?p configuration at each stereocenter. Such compounds are relatively resistant to 

25 degradation by cellular nucleases compared to unmodified DNA. 

The lead compound was re-randomlzed at 40 nucleotide positions (underlined), 
introducing mutations at a frequency of 15% <5% probability of each of the three 
possible base substitutions). The re-randomlzed population was subjected to seven 
additional rounds of in vitro selection. During the last four rounds, molecules that were 

30 reactive in the presence of 1 mM Pb^* were removed from the population before the 

remainder were challenged to react in the presence of 1 mM Mg*\ Individual clones 
were isolated following the seventh round and the nucleotide sequence of 14 of these 
clones was determined. All of the sequences began with: 5' GGG ACG AAT TCT AAT 
ACQ ACT CAC TAT rA 66 AAG A6A T6G CGA CAT CTC (SEQ ID NO 23, from position 

35 1 to 48), and ended with: GT6 ACG 6TA AGC TTG GCA C 3 ' (SEQ ID NO 23, from 

position 89 to 107). 

The segment in the middle, corresponding to the 40 partially-randomized 
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positions SEQ 10 NO 23, from position 49 to 88), varied as follows: 



10 



15 



2C 



(4> TAC AGC GAT TCA CCC TTG TTT AAG 6GT TAC ACC CAT GTT A 
(SEQ ID NO 31) 

(2) ATC AGC GAT TAA CGC TTG TTT CAA TGT TAC ACC CAT GTT A 
(SEQ ID NO 32) 

(2) TTC AGC GAT TAA CGC TTA TTT TAG CGT TAC ACC CAT GTT A 

(SEQ ID NO 33) 

( 1 ) ATC AGC GAT TCA CCC TTG TTT TAA GGT TGC ACC CAT GTT A 
(SEQ ID NO 34) 

(1) ATC AGC GAT TCA CCC TTG TTT AAG CGT TAC ACC CAT GTT G 
(SEQ ID NO 35) 

( 1 ) ATC AGC GAT TCA CCC TTG TTT TAA GGT TAC ACC CAT GTT A 
(SEQ ID NO 36) 

(1 ) ATC AGC GAT TAA CGC TTA TTT TAG CGT TAC ACC CAT GTT A 

(SEQ ID NO 37) 

(1) ATC AGC GAT TAA CGC TTG TTT TAG TGT TGC ACC CAT GTT A 
(SEQ ID NO 38) 

( 1 ) ATC AGC GAT TAA CGC TTA TTT TAG CAT TAC ACC CAT GTT A 
(SEQ ID NO 39). 



The number in parentheses indicates the number of clones having that particular 
sequence. Nucleotides shown in bold are those that differ compared to the lead 
compound. 

25 Formal analysis of the cleavage activity of these clones Is ongoing. The 

population as a whole exhibits Mg^^ -dependent cleavage activity at an observed rate of 
- 10^' min'\ with a comparable level of activity in the presence of Pb'^. 

Figures 6A end 6B provide two-dimensional illustrations of a "progenitor" 
catalytic DNA molecule and one of several catalytic DNA molecules obtained via the 

30 selective amplification methods disclosed herein, respectively. Rgure 6A illustrates an 

exemplary molecule from the starting pooh showing the overall configuration of the 
molecules represented by SEQ ID NO 23. As illustrated, various complementary 
nucleotides flank the random (N40) region. 

Figure 6B is a diagrammatic representation of one of the Mg'^-dependent 

35 catalytic DNA molecules (or "DNAzymes") generated via the within-described 

procedures. The location of the ribonucleotide In the substrate nucleic acid is indicated 
via the arrow. (The illustrated molecule includes the sequence identified herein as SEQ 



wo 96/17086 



PCT/US95/15580 



-47- 

ID NO 25. as well as "beginning" and "ending" sequences of SEQ ID NO 23.) 

Endonuclease activity ts continuing to be enhanced in each of the 
aforementioned **families" via in vitro evolution, as disclosed herein, so it is anticipated 
that enzymatic DNA molecules of increasingly desirable speeifrcities may be generated 
5 successfully using the within-disclosed guidelines. 

Example 5 
Cleavage of \ arnBf RNA Seouences 
As an extension of the foregoing, we have developed DNA enzymes that cleave 
an all-RNA substrate, rather than a single ribonucleotide embedded within an otherwise 
10 all-DNA substrate as demonstrated above. (Also see R.R. Brealcer & 6.F. Joyce* Chcmi 

& Biol. 1 : 223-229 (1994); R.R. Breaker & G.F, Joyce, Ghftm. ft BiQl, I' 655-660 
(19951). As a target sequence, we chose a stretch of 12 highly-conserved nucleotides 
within the U5 LTR region of HIV-1 RNA, having the sequence 
5' GUAACUAGAGAU 3' (SEQ ID NO 49). 
1 5 Following the methods described In the previous examples, we generated a pool 

of 1014 DNA molecules that have the following composition: 

5 - GQAAAA r(GUAACUAGAGAU) GG A AG AG ATGGCG AC Ngo 
CGGTAAGCTTGGCAC -3' (SEQ ID NO 50), 
where H is an equimolar mixture of the deoxyribonucleotides G, A, T, and C, and where 
20 the sequence identified as 'r (GUAACUAGAGAU)' is comprised of r/tonucleotides. 

(Optionally, one may alter the initial 5' nucleotide sequence, e.g., by adding an 
additional dA residue to the sequence preceding the ribonucleotide portion at the 5* end, 
thus causing the initial sequence to read "GGAAAAA" and causing SEC ID NO 50 to be 
99 residues in length. Clearly, this is but one example of the modifications that may be 
25 made in order to engineer specific enzymatic DNA molecules, as disclosed in detail 

herein.) 

The enzymatic DNA molecules thus produced were selected for their ability to 
cleave a phosphoester that lies within the embedded RNA target sequence. Ten rounds 
of in vitro selective amplification were carried out. based on the enzymatic DNA 

30 molecules' activity in the presence of 10 mM Mg*"" at pH 7.5 and 37*C. During the 

selection process, there was competition for 'preferred" cleavage sites as well as for the 
-best" catalyst that cleaves at each such preferred site. Two sites and two families of 
catalysts emerged as possessing the most efficient cleavage capabilities Isee Fig. 7). 
Figure 7 illustrates some of the results of ten rounds of in vitro selective 

35 amplification carried out essentially as described herein. As shown, two sites and two 

families of catalysts emerged as displaying the most efficient cleavage of the target 
sequence. Cleavage conditions were essentially as indicated in Fig. 7, namely, lOmM 
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hAg^*, pH 7.5, and 37*; data collected after the reaction ran for 2 hours is shown* 
Cleavage (%) is shown plotted against the number of generations (here, 0 through 10). 
The nurDber/prevalence of catalytic ONA molecules capable of cleaving the target 
sequence at the indicated sites in the substrate is illustrated via the vertical bars, with 
5 cleavage at GIUAACUAGAGAU shown by the striped bars, and with cleavage at 

GUAACUAJGAGAU Hlustrated via the open (lightly-shaded) bars. In Figure 7, as herein, 
the arrow (I) indicates the site between two neighboring nucleotides at which cleavage 
occurs. 

Various Individuals from the population obtained after the 8th and 10th rounds 

10 of selective amplification were cloned. The nucleotide sequences of 29 individuals from 

the Bth round and 32 individuals from the 10th round were then determined (see Tables 
2 end 3, respectively). 

Under the heading "Nucleotide Sequence' in each of Tables 2 and 3 Is shown 
the portion of each Identified clone that corresponds to the 50 nucleotides that were 

15 randornized in the starting pool (i.e.i Ns^); thus, the entire nucleotide sequence of a 

given clone generally includes the nucleotide sequences preceding, following, and 
including the "N^q" segment, presuming the substrate sequence is attached ar>d that 
self-cleavage has not occurred. For example, the entire sequence of a (non-self-cieaved) 
clone may generally comprise residue nos. 1-33 of SEQ ID NO 50, followed by the 

20 residues representing the randomized N^o region, followed by residue nos. 84-98 of SEQ 

ID NO 50, or by residue nos. 1-34 of SEQ ID NO 51, followed by the residues 
representing the randomized N^o region, followed by residue nos. 85*99 of SEQ ID NO 
51 . It is believed, however, that the N50 (or N40) region or a portion thereof - of each 
clone is particularly important in determining the specificity and/or activity of a particular 

25 enzymatic DNA molecule. This Is particularly evident in reactions in which the substrate 

and the DNAzyme are separate molecules (see, e.g.. Figs. 8 and 9). 

Clone numbers are designated as 8-x or 10-x for individuals obtained after the 
Bth or 10th rounds, respectively. SEQ ID NOS are also listed and correspond to the 
"Nso" region of each clone. 
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Table 2 

Cloned Individuals from 8th Round of AmplW Ication 



Clone SEQ 

No. tP NO "N g^ " Nuclp ntide Sequence f5'-3') 



8-2 52 CCA ATA GTG CTA CTG TGT ATC TCA ATG CTG GAA ACA CGG GTT 
ATC TCC CG 

8-4 53 CCA AAA CAG TG6 AGC ATT ATA TCT ACT CCA CAA AGA CCA CTT 
TTC TCC CG 

1 0 8-5' 54 ATC CGT ACT AGC ATG CAG ACA GTC TGT CTG CTT TTT CAT TAC 

TCA CTC CC 

8-14 55 CAA TTC ATG ATG ACC AAC TCT GTC AAC ACG CGA ACT TTT AAC 
ACT GGC A 

8-17* 56 CTT CCA CCT TCC GAG CCG GAC GAA GTT ACT TTT TAT CAC ACT 
15 ACQ TAT TG 

8-3 57 GGC AAG AGA TGG CAT ATA TTC AG6 TAA CTG TGG AGA TAC CCT 
GTC TGC CA 

8-6 58 CTA GAC CAT TCA CGT TTA CCA AGC TAT GGT AAG AAC TAG AAT 
CAC GCG TA 

20 8-8 59 CGT ACA CGT GGA AAA GCT ATA AGT CAA GTT CTC ATC ATG TAC 

CTG ACC GC 

8-1 0 60 CAG TGA TAC ATG AGT GCA CCG CTA CGA CTA AGT CTG TAA CTT 
ATT CTA CC 

8-22 61 ACC GAA TTA AAC TAC CGA ATA GTG TGG TTT CTA TGC TTC TTC 
25 TTC CCT GA 

8-1 1 62 CAG GTA GAT ATA ATG CGT CAC CGT GCT TAC ACT CGT TTT ATT 
AGT ATG TC 

8-21 63 CCC TAC AAC ACC ACT GGG CCC AAT TAG ATT AAC GCT ATT TTA 
TAA CTC G 

30 8-12 64 CCA AAC GGT TAT AAG ACT GAA AAC TCA ATC AAT AGC CCA ATC 

CTC GCC C 

8-1 3 65 CAC ATG TAT ACC TAA GAA ATT GGT CCC GTA GAC GTC ACA GAC 
TTA CGC CA 

8-23 66 CAC AAC GAA AAC AAT CTT CCT TGG CAT ACT GGG GAG AAA GTC 
35 TGT TGT CC 

8-40 67 CAC ACG AAC ATG TCC ATT AAA TGG CAT TCC GTT TTT CGT TCT 
ACA TAT GC 
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8-24 68 CAG AAC GAG GGT CTT GTA AGA CTA CAC CTC CTC AGT GAC AAT 

AAT CCT G 

8-26 69 CAC TAC AGC CTG ATA TAT ATG AAG AAC A6G CAA CAA GCT TAT 
GCA CTG G 

5 8-27 70 GGG TAC ATT TAT GAT TCT CTT ATA AAG AGA ATA TCG TAC TCT 

TTT CCC CA 

8-28 71 CCA AAG TAC ATT CCA ACC CCT TAT ACG TGA AAC TTC CAG TAG 
TTT CCT A 

8-28 72 CTT GAA GAT CCT CAT AAG ACG ATT AAA CAA TCC ACT GGA TAT 
10 AATCCGGA 

8-34 73 CGA ATA GTG TCC ATG ATT ACA CCA ATA ACT GCC TGC CTA TCA 
TGTTTATG 

B-35 74 CCA AGA GAG TAT CGG ATA CAC TTG GAA CAT AGC TAA CTC GAA 
CTG TAC CA 

1 5 8-36 75 CCA CTG ATA AAT AGG TAA CTG TCT CAT ATC TGC CAA TCA TAT 

GCC GTA 

8-37 76 CCC AAA TTA TAA ACA ATT TAA CAC AAG CAA AAG GAG GTT CAT 
TGC TCC GC 

8-39 77 CAA TAA ACT GGT GCT AAA CCT AAT ACC TTG TAT CCA AGT TAT 
20 CCT CCC CC 

^ Identical to 10-4, 10-40 

^ identical to 8-20, 8-32, 8-38, 10-1, 10-34; 1 mutation to 10-1 1; 3 mutations 
2S to 10-29 

Tables 

Cloned Individuals from 1 0th Round of Amplification 

30 Clone SEQ 

No- IP NO "N,^" Muclenlide SRniiRnce rS'^S'l 

1 0-3* 78 CCQ AAT GAC ATC CGT AGT GGA ACC TTG CTT TTG ACA CTA AGA 
AGC TAC AC 

10-10 79 CCA TAA CAA ATA CCA TAG TAA AGA TCT GCA TTA TAT TAT ATC 
36 GGT CCA CC 

10-12 80 CAG AAC AAA GAT CAG TAG CTA AAC ATA TGG TAC AAA CAT ACC 
ATC TCG CA 

10-14 81 CCT TTA GTT AGG CTA GCT ACA ACG ATT TTT CCC TGC TTG GCA 
ACG ACA C 
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10-1 5 82 CTC CCT ACG TTA CAC CAG CGG TAG GAA TTT TCC ACQ AGA GGT 
AAT CCQ CA 

10-19 83 CGG CAC CTC TAG TTA GAC ACT CCG GAA TTT TTC CCC 
10-39 84 CGG CAC CTC TAG TTA GAC ACT CCG GAA TTT TAG CCT ACC ATA 
GTC CGG T 

1 0-23 85 CCC TTT GGT TAG GCT AGC TAC AAC GAT TTT TCC CTG CTT GAA 
TTGTA 

1 0-27* 86 CCC TTT GGT TAG GCT AGC TAC AAC GAT TTT TCC CTG CTT GAC 

CTG TTA CGA 

10-31 87 CCT TTA QTT AGG CTA GCT ACA ACG ATT TTT CCC TGC TTG GAA 
CGA CAC 

10- 1 8 88 CAT 6GC TTA ATC ATC CTC AAT AGA AGA CTA C AA GTC GAA TAT 
GTC CCC CC 

1 0-20 89 CAA CAG AGC GAG TAT CAC CCC CTG TCA ATA GTC QTA T6A AAC 
ATT GGG CC 

1 0-6 90 TAC CGA CAA GGG GAA TTA AAA GCT AGC T6G TTA TGC AAC CCT 
TTT CGC A 

10-7 91 CTC GAA ACA GTG ATA TTC TGA ACA AAC GGG TAC TAC GTG TTC 
AGC CCC C 

1 0-8 92 CCA ATA ACG TAA CCC GGT TAG ATA AGC ACT TAG CTA AGA TGT 

TTA TCC T6 

10-16 93 CAA TAC AAT CGG TAC GAA TCC AGA AAC ATA ACG TTG TTT CAG 
AAT GGT CC 

10-21 94 GCA ACA ACA AGA ACC AAQ TTA CAT ACA COT TCA TOT ATA CTG 
AAC CCC CA 

1 0 24 9S CCT TTG AGT TCC TAA ATG CCG CAC GGT AAG CTT QQC ACA CTT 
TGA CTG TA 

1 0-28 96 CAA AGA TCT CAC TTT GGA AAT 6CG AAA TAT GTA TAT TCQ CCC 

TGT CTG C 

1 0-33 97 CCA CGT AGA ATT ATC TGA TTT ATA ACA TAA CGC AGG ATA ACT 
CTC GCC CA 

1 0-35 98 CAC AAG AAA GTG TC6 TCT CCA GAT ATT TGA GTA CAA GGA ACT 
ACG CCC 

1 0-36 99 CAT GAA GAA ATA GGA CAT TCT ACA QGC TGG ACC GTT ACT ATG 

CCT GTA GG 

10-37 100 CAT AGG ATA ATC ATG GCG ATG CTT ATG ACG TGT ACA TCT ATA 
CCTT 
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10-38 101 CAG ATG ATC TTC CTT TAA AGA CTA CCC TTT AAA GAA ACA TAA 
GGT ACC CC 

' 1 mutation to 10-5 
^ 1 mutation to 10-30 

The self-deavage activity of various clones v^as subsequentty measured. Clones 
8-5. 8-17, and 10-3 were found to cleave etflciontiy at the site 5* GUAACUIAGAGAU 
3', while clones 10-14, 10-19 and 10-27 were found to cleave efficiently at the site 5' 
GiUAACUAGAGAU 3\ When the RNA portion of the molecule was extended to the 
sequence 5' GGAAAAAGUAACUAGAGAUGGAAG 3' (residue nos. 1-24 of SEQ ID NO 
51), clones B-17, 10-14, and 10-27 retained full activity, while clones B-5, 10-3, and 
10-19 showed diminished activity. Subsequently, done 10-23 was found to exhibit a 
high level of activity in the self-cleavage reaction involving the extended RNA domain. 

It should also be noted, in the event one of skill in the relevant art does not 
appreciate same, that the nucleotide sequences preceding and following the "Ngg" 
segments of the polynucleotide molecules engineered according to the teachings of the 
present invention disclosure may be altered in a variety of ways In order to generate 
enzymatic DNA molecules of particular specificities. For example, while residue nos. 1- 
24 of SEQ ID NO 51 are described herein es RNA nucleotides, they may alternatively 
comprise DNA, RNA, or composites thereof. (Thus, for example, SEQ ID NO 51 could 
easily be altered so that nucleic acid residue ros. 1-7 would comprise DNA, residue nos. 
8-19 would comprise RNA, residue nos. 20-99 would comprise DNA, and so on.) 
Similarly, the nudeotfdes following the "'Nbo" region may comprise RNA, DNA, or 
composites thereof. The length of the regions preceding and following the "Nsq" (or 
'N40" see Example 4) region(s) may also be varied, as disdosed herein. Further, 
sequences preceding and/or following Ngo or N40 regions may be shortened, expanded, 
or deleted in their entirety. 

Moreover, as noted above, we selected a specific region of HIV-1 RNA as the 
target sequence in the methods described in this Example; such a sequence is not the 
only sequence one may use as a target. Clearly, one of skill in the relevant art may 
follow our teachings herein to engineer and design enzymatic DNA molecules with 
spedfidty for other target sequences. As disclosed herein, such target sequences may 
be constructed or Inserted Into larger sequences comprising DNA, RNA, or composites 
thereof, as illustrated by SEQ ID NOS 50 and 51. 

The self-cleavage reaction was easily converted to an intermolecular cleavage 
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reaction by dividing the enzyme and substrate domains into separate nr>olecules. Clones 
8-1 7 and 10-23 were chosen as prototype molecules. Both were shown to act as DNA 
enzymes in the cleavage of a separate all-RNA substrate in a reaction that proceeds with 
multiple turnover (Fig, 8). The substrate binding arms were subsequently reduced to 7 
5 base-pairs on each side of the unpaired nucleotide that demarcates the cleavage site 

(Rg. 9). 

Figure 8 Illustrates the nucleotide sequences, cleavage sites, and turnover rates 
of two catalytic DNA molecules of the present invention, clones 8-17 and 10-23. 
Reaction conditions were as shown, namely, 10mM Mg'*, pH 7.5. and 37'C. The 

10 DNAzyme identified as clone 8-17 Is illustrated on the left, with the site of cleavage of 

the RNA substrate indicated by the arrow. The substrate sequence 15' - 
GGAAAAA6UAACUAGAGAUGGAAG • 3') - which is separate from the DNAzyme (i.e., 
intermolecular cleavage is shown) - is labeled as such. Similarly, the DNAzyme 
Identified herein as 10-23 is shown on the right, with the site of cleavage of the RNA 

15 substrate indicated by the arrow. Again, the substrate sequence Is indicated. For the 8- 

17 enzyme, the turnover rate was approximately 0.6 hr»; for the 10-23 enzyme, the 
turnover rate was approximately 1 hr\ 

As illustrated in Fig, 8, the nucleotide sequence of the clone 8-17 catalytic DNA 
molecule capable of cleaving a separate subatrate molecule was as follows: 

20 5'-CTTCCACCTTCCGAGCCGGACGAAGTTACTTTTT-3* (residue nos. 1-34 of SEQ ID 

NO 56). In that same figure, the nucleotide sequence of the clone 10-23 catalytic DNA 
molecule capable of cleaving a separate substrate molecule was as follows: 
5'-CTTTGGTTAGGCTAGCTACAACGATTTTTCC-3* (residue nos. 3-33 of SEQ ID NO 

65). 

25 Figure 9 further illustrates the nucleotide sequences, cleavage sites, and 

turnover rates of two catalytic DNA molecules of the present invention, clones 8-17 and 
10-23. Reaction conditions were as shown, namely, lOmM Mg^% pH 7.5, and 37*C. 
As in Fig. 8, the DNAzyme Identified as clone 8-17 is Illustrated on the left, with the site 
of cleavage of the RNA substrate indicated by the arrow. The substrate sequence (5' - 

30 GGAAAAAGUAACUAGAGAUGGAAG - 3*) -which is separate from the DNAzyme (i.e., 

intermolecular cleavage is shown) - is labeled as such. Similarly, the DNAzyme 
identified herein as 10-23 is shown on the right, with the site of cleavage of the RNA 
substrate indicated by the arrow. Again, the substrate sequence is indicated. For the 8- 
17 enzyme, k^^ was approximately 0.002 min '; for the 10-23 enzyme, the value of 

35 was epproximately 0.01 min '. 

As Illustrated in Fig. 9, the nucleotide sequence of the clone 8-17 catalytic DNA 
molecule capable of cleaving a separate substrate molecule was as follows: 



wo 96/17096 



PCTA}S95^15580 



-54- 

5'-CCACCTTCCGAGCCGGACGAAGTTACT-3* (residue nos. 4-30 of SEQ ID NO 56). In 
that same figure, the nucleotide sequence of the clone 10-23 catalytic DNA molecule 
capable of cleaving a separate substrate molecule was as follows: 
5*-CTAGTTAGGCTAGCTACAACGATTTTTCC-3' (residue nos. 5-33 of SEQ ID NO 85, 
5 with XTA* substituted for "TTG" at the 5' end). 

The catalytic rate of the RNA-cleaving DNA enzymes has yet to be fully 
optimized. As disclosed above and as reported in previous studies, we have been able 
to improve the catalytic rate by partially randomizing the prototype molecule and 
carrying out additional rounds of selective amplification. We have found, however, that 
10 the for Mg'^ is approximately 5 mM and 2 mM for the 8-17 and 10-23 DNA 

enzymes, respectively, measured at pH 7.5 and 37*C; this is certainly compatible with 
intracellular conditions. 



The foregoing specification, including the specific embodiments and examples, Is 
1 5 Intended to be Illustrative of the present invention and is not to be taken as limiting. 

Numerous other variations and modifications can be effected without departing from the 
true spirit and scope of the present invention. 
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(C) CITY: La Jolla 

(D) STATE: California 

(E) COUNTRY: United States 

(F) ZIP: 92037 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Pa tent In Release fH.O, Version #1-25 

(vi) CURRENT APPLICATION DATA: 

{A) APPLICATION NUMBER: PCT/US95/ 

(B) FILING DATE: Ol-DEC-1995 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/472,194 

(B) FILING DATE: 07-JUN-1995 

(vii) PRIOR APPLICATION DATA: 

<A) APPLICATION NUMBER: US 08/349,023 
(B) FILING DATE: 02 -DEC- 19 9 4 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Logan, April c. 

(B) REGISTRATION NUMBER: 33,950 

(C) REFERENCE/DOCKET NUMBER: 463.2 PC 
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(ix) TELECOMMUNICATION INFORMATION; 

(A) TELEPHONE: (619) 554-2937 

(B) TELEFAXi (€19) 554-6312 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA {genomic) 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
C6GTAAGCTT GGCAC 15 
20 (2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQOTNCB CHARACTERISTICS; 

<A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
25 (C) 8TRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

30 (ix) FEATURE: 

(A) NAME/KEY: misc_dif f erence 

(B) LOCATION: replace (B, »•*) 

(D) OTHER INFORMATION: /standard.name^ "ADENOSINE 
RIBONUCLEOTIDE" 

35 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
TCACTATNAG 6AAGAGATGO 
5 (2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

15 

U±) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
ACACATCTCT 6AAGTAGCGC CGCCGTATAG TGACGCTA 
20 (2) INFORMATION FOR SEQ ID NO; 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

30 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:*: 

GTGCCAAGCT TACCGNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNMNNNH NNNNNNNNNN 
NNNNNGTCGC CATCTCTTCC 

35 



PCTAJS9S/1S580 



-58- 

(2) INFOKMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_£eature 

(B) LOCATION: 2 8 

(D) OTHER INFORMATION: /stanclard_naine= "2 '3' CYCLIC 
PHOSPHATE" 

(ix) FEATURE: 

(A) KAME/KEY: misc_dif f erence 

(B) LOCATION: replace(28. 

(D) OTHER INFORMATION: /standard_name« "ADENOSINE 
RIBONUCLEOTIDE " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
G66ACGAATT CTAATAC6AC TCACTATN 28 
(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: DNA (genomic) 
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26 



iix) FEATURE: 

(A) NAME/KEY: misc^diff erence 

(B) LOCATION: replace (28, 
5 (D) OTHER INFORMATION: /standard_naine= "ADENOSINE 

RIBONUCLEOTIDE" 

(Xt) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
10 GGGACGAATT CTAATACGAC TCACTATO 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

15. (A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_dif f erence 

(B) LOCATION: replace (8, "») 

25 (D) OTHER INFORMATION: /standard_nan»e= "ADENOSINE 

RIBONUCLEOTIDB " 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 

19 

30 TCACTATNGG AAGAGATOG 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 8 base pairs 

(B> TYPE: nucleic acid 
(C> STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: misc__dif ference 

(B) LOCATION: replace (8, 

(D) OTHER INFORMATION: /standard_naineB "ADENOSINE 
NUCLEOTIDE" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:B: 

TCACTATN 8 

(2) INFORMATION FOR SEQ ID NO: 9: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CCATCTCTTC CTATAGTGAG TC0GGCT6CA 30 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: IS base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GTGCCAAGCT TACCG 15 
5 (2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO; 11: 

CTGCAGAATT CTAATACGAC TCACTATAG6 AAGAGATGGC GAC 43 
(2) INFORMATION FOR SEQ ID NO: 12: 

20 

[i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 
30 (A) NAME/KEY: roisc_dif f erence 

(B) LOCATION: replace (8. "•) 

(D) OTHER INFORMATION: /standard_name« •ADENOSINE 

RIBONUCLEOTIDE** 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

TCACTATNOQ AA6AGATGG 
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(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc^dif ference 

(B) LOCATION: replace(28, 

(D) OTHER INFORMATION: /standard^name- "ADENOSINE 
RIBONUCLEOTIDE " 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:13: 

GGGAC6AATT CTAATACGAC TCACTATNGG AAGAGATGGC GAC 43 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic^ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
TCACACATCT CTGAAGTAGC 6CCGCCGTAT GTGACGCTA6 GGGTTOGCCT 50 
(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

10 GGGGGGAACG CCGTAACAA6 CTCTGAACTA GCGGTTGCGA TATAGTCGTA 50 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:16: 
CGGGACTCCG TAGCCCATTG CTTTTTGCAG CGTCAACGAA TAGCGTATTA 50 
(2) INFORMATION FOR SEQ ID NO:17: 



25 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 50 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOIX)GY: linear 

(ii) MOLECULE TYPEi DNA (genomic) 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CCACCATGTC TTCTCGAGCC GAACCGATAG TTACGTCATA CCTCCCGTAT 50 
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(2) INFORMATION FOR SEQ ID H0:18: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 50 base pairs 
<B) TYPE: miclelc acid 
(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA {genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:18: 
GCCAGATTGC TGCTACCAGC GGTAC©AAT AGTGAAGTGT TCGTCACTAT 50 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
ATAGGCCATG CTTTGGCTAG CGGCACCGTA TA6TGTACCT GCCCTTATCG 50 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



wo 96/17086 



PCrAIS9Sa5S80 



-65- 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO:20: 
TCTGCTCTCC TCTATTCTAG CAGTGCAGCG AAATATGTCG AATAGTCGGT 50 
5 (2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE J nucleic acid 
10 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
15 (xl) SEQUENCE DESCRIPTION: SEQ ID NO:21> 

TTGCCCAGCA TA6TCGGCAG ACGTGGTGTT AGCGACACGA TAGQCCCGGT 50 
(2) INFORMATION FOR SEQ ID NO: 22: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

30 

TTGCTAGCTC GGCTGAACTT CTGTAGCGCA ACCGAAATAG TGAGGCTT6A 50 
(2) INFORMATION FOR SEQ ID NO:23: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 base pairs 

(B) TYPE: nucleic acid 
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<C) STRANDEDNBSS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genoroic) 

(ix) FEATURE: 

(A) NAME/KEY; misc^dif f erence 

(B) LOCATION: replace (28, "•') 

(D) OTHER INFORMATION: /standard_naine- "ADENOSINE 
RIBONUCLEOTIDE" 
/label- rA 

fxi) SEQUENCE DESCRIPTION: SEQ ID NO:23 : 

GGGACGAATT CTAATACGAC TCACTATNGG AAGAGATGGC GACATCTCNN NNNNNNNNNN €0 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNGT GACGGTAAGC TTGGCAC 107 

(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DKA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
CCGCCCACCT CTTTTACGAG CCTGTACGAA ATAGTGCTCT TGTTAGTAT 49 
(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
<xi) SEQUEMCB DESCRIPTION: SEQ ID NO: 25: 

TCTCTTCAGC GATGCACGCT TGTTTTAATG TTGCACCCAT GTTAGTGA 

(2) INFORMATION PGR SEQ ID NO: 26: 

(i) SEQOENCB CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SBQtJENCE DESCRIPTION: SEQ ID NO: 26: 

TCTCATCAGC GATTGAACCA CTTGGTGGAC AGACCCATGT TAGTGA 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
CCGCCCACCT CTTTTACGAG CCTGTACOAA ATAGTQTTCT TGTTAGTAT 
(2) INFORMATION FOR SEQ ID NO: 28: 
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ix) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
CCGCCCACCT CTTTTACGAG CCTGTACGAA ATAGTOCTCT CGTTAGTAT 41 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
TCTCAC3ACTT AGTCCATCy^C ACTCTGTGCA TATGCCTGCT TGATGTGA 48 
(2) INFORMATION FOR SEQ ID HO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
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CTCTCATCTG CTAGCACGCT CGAATAGTGT CAGTCGATGT GA .42 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOI^CIXLE TYPE: DKA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

15 TACAGCGATT CACCCTTGTT TAAGGGTTAC ACCCATGTTA 40 

(2) INFORMATION FOR SEQ ID NO:32: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

ATCAGCGATT AACGCTTGTT TCAATQTTAC ACCCATGTTA 40 

(2) INFORMATION FOR SEQ ID NO: 33: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 40 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



wo 96/17086 PCTAJS9S/15580 

-70- 

(ii) MOLECULE TYPE: DNA Cgenomic) 

(xi) SEQUEflCB DESCRIPTION: SEQ ID NO: 33: 

5 TTCAGCOATT AACGCTTATT TTAGCGTTAC ACCCATGTTA 40 

(2) INPORmTION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID KO:34: 
ATCAGCGATT CACCCTTGTT TIAAG6TTGC ACCCATGTTA 40 
(2) INFORMATION FOR SEQ ID NO:35: 



20 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 40 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNES5: single 

(D) TOPOLOGY: linear 



30 



(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
ATCAGCGATT CACCCTTGTT TAAGCGTTAC ACCCATGTTG 40 
35 (2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: DZTA {genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 

10 ATCAGCGATT CACCCTTGTT TTAAGGTTAC ACCCATGTTA 40 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
ATCAGCGATT AACGCTTATT TTAGCGTTAC ACCCATGTTA *° 
(2) INFORMATION FOR SEQ ID NO:38: 



25 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 40 base pairs 
3Q (B) TYPE: nucleic acid 

(Cl STRANDEDNESS: single 
(D) TOPOLOGY: linear 



35 



(ii) MOLECULE TYPE: I^A (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
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ATCAGCGATT AACGCTTGTT TTAGTGTTQC ACCCATGTTA 40 
(2) IKFORMATION FOR SEQ ID NO: 39: 

<i) SBQUENCB CHA2UICTBRZSTICS : 

(A) LENGTH; 40 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 39: 
ATCAGCGATT AACGCTTATT TTAGCATTAC ACCCATGTTA 40 
(2) INFORMATION FOR SEQ ID NO:40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIFTIC^: SEQ ID NO: 40: 
GCCATGCTTT 10 
(2) INFORMATION FOR SEQ ID NO: 41: 

<i} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 
CTCTATTTCT 

(2) INFORMATION FOR SEQ ID NO; 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 42: 
TATGTGACGC TA 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLE(OT*E TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 
TATAGTCGTA 

(2) INFORMATION FOR SEQ ID NO: 44; 



<i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

S 

(ii) MOLECDLE TYTE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
10 ATAGCGTATT A 11 

(2) IHFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

15 (A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 



25 



35 



ATAGTTACGT CAT 13 



(2) INFORMATION FOR SEQ ID NO: 46: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 14 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
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AJITA6TGMG TGTT 

(2) INFORMATION FOR SEQ ID NO:47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
<xi) SEQUENCE DESCRIPTION: SEQ ID N0:47: 
ATA6GCCC66 T 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
AATAGT6AGG CTT6 

(2) INFORMATION FOR SEQ ID NO:49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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30 



-76" 

(ii) MOLECULE TYPE: RNA (genomic) 
(ill) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:49: 
GUAACOAGAG AU 12 
(2) INFORMATION FOR SEQ ID NO: 50: 



(i) SEQX7ENCE CHARACTERISTICS: 

(A) LENGTH: 9B base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI*SENSE: NO 
20 (ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 7. .18 

(D) OTHER INFORMATION: /note- "Position 7-18 is RNA; the 

remainder o£ the sequence is DMA." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

6GAAAAGUAA CUAGAGADGG AAGAGATGGC GACNNNNNNN NNNNNNNNNN NNNNNNNNNN 60 
NNNNNNNNNN NNNNNNNNNN NHNCGGTAA6 CTTGGCAC 98 

(2) INFORMATION FOR SEQ ID NO: 51: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 99 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(il) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(Iv) ANTI-SENSE: NO 
(ix) FEATURE: 
5 (A) NAME/KEY: raisc_feature 

(B> LOCATION: 1..24 

(D) OTHER INFORMATION! /note= "Positions X-24 is RNA; the 
remainder of the sequence is DNA." 

10 Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

GQAAAAAGUA ACOAGAGAUG GAAGAGATGG CGACNNNNNN NNNNNNNNNN NNNNNNNNNN 60 
NNNNNNNNNN NNNNNNNNNN NNNNCGQTAA GCTTGGCAC 

15 (2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNBSS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
25 (iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52; 

CCAATAGTGC TACTGTGTAT CTCAATOCTG GAAACACGGG TTATCTCCCG 50 

(2) INFORMATION FOR SEQ ID NO: 53: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) XiENGTH: SO base pairs 
35 (B) TYPES nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) 



MOLECOLB TYPE: Z>NA (genomic) 



(iii) 



HYPOTHETICAL: NO 



<iv) 



Airri*SENSE: KO 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 53: 



CCAAAACAGT GGAGCATTAT ATCTACTCCA CAAAGACCAC TTTTCTCCCG 



50 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLBCOLE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SBN5E: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
ATCCGTACTA GCATGCAGAC AGTCTGTCTG COTTTTCATT ACTCACTCCC 50 
(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
CAATTCATGA TGACCAACTC TGTCAACACG CGAACTTTTA ACACTGGCA 49 
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(2) INFORMATION FOR SEQ ID NO: 56: 

(1) SBQUENCE CHARACTERISTICS: 

(A) 1£NGTH: 50 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLBCULB TYPE: DNA (genomic) 
10 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56; 

15 CTTCCACCTT CCGAGCCGGA CGAAGTTACT TTTTATCACA CTACGTATTG 50 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:57: 
GGCAAGAGAT GGCATATATT CAGGTAACTG TGGAGATACC CTGTCTGCCA SO 
(2) INFORMATION FOR SEQ ID NO: 58: 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) T0P0LCX3Y: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
5 Ciii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: 

10 CTAGACCATT CACGTTTACC AAGCTATGGT AAGAACTA6A ATCAC6C6TA 50 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
CGTACACGT6 GAAAAGCTAT AAGTCAAGTT CTCATCATGT ACCTGACC(3C SO 
(2) INFORMATION FOR SEQ ID NO: 60: 



30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENCSTH: 50 bas6 pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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Civ) ANTI-SBNSB: NO 

(xi) SEQUENCE DESCRIPTION: SBQ ID KO:60: 

5 CAGTGATACA TOAGTOCACC 0CTACX3ACTA AGTCTGTAAC TTATTCTACC 50 

(2) INFORMATION FOR SEQ ID NO: 61: 

ii) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNSSS : single 
(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

<Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

20 

ACCGAATTAA ACTACCGAAT AGTGTGGTTT CTATGCTTCT TCTTCCCTGA 50 
(2) INFORMATION FOR SEQ ID NO: 62; 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : single 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID N0j62: 



CAGGTAGATA TAATGCGTCA CCGTGCTTAC ACTCGTTTTA TTAGTATGTC 



SO 
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(2) INFORMATIOK FOR SEQ ID NO: 63: 

(1) SEQUEKCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 
5 (B) TYPE: nucleic acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
10 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

15. CCCTACAACA CCACTGGGCC CAATTAGATT AACGCTATTT TATAACTCG 4 9 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 4 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
CCAAAC6GTT ATAAGACTGA AAACTCAATC AATAGCCCAA TCCTC60CC 49 
(2) INFORMATION FOR SEQ ID N0:65: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 
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(C) smMJDEVmsS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DKA (genomic) 
5 (iii) HYPOTHETICAL: NO 

(iv) AMTI*SEKSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

10 CACATGTATA CCTAAGAAAT TGGTCCCGTA GACGTCACAQ ACTTACOCCA 50 

(2) INFORMATION FOR SEQ ID NO: 66: 

<i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNBSS : single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



25 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO:66: 
CACAACGAAA ACAATCTTCC TTGGCATACT GGQGAGAAA6 TCTGTTGTCC 50 
(2) INFORMATION FOR SEQ ID NO:67: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPEt nucleic acid 

(C) STRANDEDNBSS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI- SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO; 67: 

5 CACACGAACA TGTCCATTAA ATGGCATTCC GTTTTTCGTT CTACATATQC 50 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

20 

CAGAACGAGG GTCTTGTAAG ACTACACCTC CTCAGTGACA ATAATCCTG 49 
(2) INFORMATION FOR SEQ ID NO: 69: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

35 (xi) SEQUENCE DESCZRIPTIQN; SEQ ID NO: 69: 



CACTACAGCC TGATATATAT GAAGAACAGG CAACAAGCTT ATGCACTGG 



49 
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{2) INFORMATION FOR SEQ ID KO:70: 

Ix) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SEHSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:70: 
GGGTACATTT ATGATTCTCT TATAAAGAGA ATATCGTACT CTTTTCCCCA 
(2) INFORMATION FOR SEQ ID NO: 71: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
CCAAAGTACA TTCCAACCCC TTATACGTGA AACTTCCAGT AGTTTCCTA 
(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 
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(C) STRAKDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA (genomic) 
5 <iii) HYFOTHBTZCAL: NO 

(iv> ANTI -SENSE; NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

10 CTTGAAGATC CTCATAAGAC GATTAAACAA TCCACTGGAT ATAATCCGGA 50 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

25 

CGAATAGT6T CCATGATTAC ACCAATAACT GCCTGCCTAT CATGTTTATG 50 
(2) INFORMATION FOR SEQ ID NO: 74: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



wo 96/17086 



PCT/DS95/15580 



-87- 

(iv) ANTI-SENSB: NO 



ixi) SEOOENCE DESCRIPTION: SEQ ID NO: 74: 
5 CCAAOAOAGrr ATCGGATACA CTTGGAACAT AGCTAACTCG AACTGTACCA 



(2) INFORMATION FOR SEQ ID NO: 75: 



(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS: single 

(D) TOPOLOGY: linear 

15 {i±) NOLECntiE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
CCACTC3ATAA ATAGGTAACT GTCTCATATC T6CCAATCAT ATGCCGXA 48 



(2) INFORMATION FOR SEQ ID NO:76: 



25 (i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE5S: single 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

CCCAAATTAT AAACAATTTA ACACAAOCAA AAGGAGGTTC ATTGCTCCGC 
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(2) INPORMATIOM FOR SBQ ID N0:77; 

(i) SEQUENCE CHARACTERISTICS: 
(A> LENGTH: 50 base pairs 
5 <B) TYPE: nucleic acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(xi) MOLECULE TYPE: DNA (genomic) 
10 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:77: 

15 CAATAAACTG GTGCTAAACC TAATACCTTG TATCCAAGTT ATCCTCCCCC 50 

(2) INFORMATION FOR SEQ ID N0:7B: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
CCGAATGACA TCCCJTAGTGG AACCTTGCTT TTGACACTAA GAAGCTACAC 50 
(2) INFORMATION FOR SBQ ID N0:79: 



35 



(i) SEQUENCE OiARACTTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 
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(C) STRRNUfeDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
5 (iii) HYPOTHETICAL: NO 

<iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

10 CCATAACAAA TACCATAGTA AAGATCTGCA TTATATTATA TCGGTCCACC 50 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

<D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE; DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



25 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
CAGAACAAAG ATCAGTAGCT AAACATATGG TACAAACATA CCATCTCGCA 50 
(2) INFORMATION FOR SEQ ID NO: 81: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(li) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) AMTI- SENSE: KO 

(xi> SEQUENCE DESCRIPTION: SBQ ID NO: 81: 
CCTTTAGTTA GGCTAGCTAC AACGATTTTT CCCTGCTTGG CAACGACAC 49 
(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TTPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
CTCCCTACGT TACACCAGCG GTACGAATTT TCCAOGAGAG GTAATCOGCA 50 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

.(C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 



CGGCACCTCT AGTTAGACAC TCCGGAATTT TTCCCC 



36 
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(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUE3«CK CHARACTERISTICS: 
(A) LENGTH: 49 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
10 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
15 CGGCACCTCT AGTTAGACAC TCCGGAATTT TAGCCTACCA TAGTCCGGT 



(2) INFORMATION FOR SEQ ID NO:B5: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOIjOGY: linear 

25 (ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: NO 
<iv) ANTI -SENSE: NO 
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(xi) SBQtJENCE DESCRIPTION: SEQ ID NO: 85: 
CCCTTTGGTT AGGCTAGCTA CAACGATTTT TCCCTGCTTG AATTGTA 47 



(2) INFORMATION FOR SEQ ID NO: 86: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS ! single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HyPOTHETXCAL: NO 
(iv) ANTI-SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
CCCTTTGGTT A6GCTA6CTA CAACGATTTT TCCCTGCTTG ACCTGTTACG A 51 
<2) INFOmATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNBSS! single 

(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
CCTTTAGTTA <3<3CTAGCTAC AACGATTTTT CCCTGCTTC3G AAC6ACAC 48 
(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: SO base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL; NO 
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(iv) ANTI- SENSE: NO 

(xi) SBQCENCB DESCRIPTION: SEQ ID 110:68: 

S CATGGCTTAA TCATCCTCAA TAQAA0ACTA CAAGTCGAAT ATGTCCCCCC 50 

(2) INFORMATION FOR SEQ ID NO:89: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: SO base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS 2 single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:89: 
CAACAGAGCG AGTATCACCC CCTGTCAATA GTCGTATGAA ACATTGGGCC 50 
(2) INFORMATION FOR SEQ ID NO: 90: 



25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : single 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 



TACCGACAAG GGGAATTAAA AGCTAGCTGG TTATGCAACC CTTTTCGCA 



49 
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(2) INFORMATION FOR SEQ ID NO: 91: 



(1) SEQOENCB CHARACTERISTICS: 
(A) LENGTH: 49 base pairs 
5 (B> TYPE: nitcleie acid 

(C) 5TRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
10 <iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi} SEQUENCE DESCRIPTION: SEQ ID NO: 91: 
15 CTCGAAACAG T6ATATTCTG AACAAAC66G TACTAC6TGT TCAGCCCCC 49 

(2) INFORMATION FOR SEQ ID NO: 92: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 



CCAATAACGT AACCCGGTTA GATAAGCACT TAGCTAAGAT GTTTATCCTG 50 



(2) INFORMATION FOR SEQ ID NO: 93: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 50 base pairs 
(6) TYPE: nucleic acid 
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(C) STRANDCDNESSi single 

(D) TOPOLOGY: linear 

(ii) MOI4BCULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL! NO 
(iv) ANTI- SENSE: NO 

(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 93: 
CAATACAATC GGTACGAATC CAGAAACATA ACGTTGTTTC AGAATGGTCC 
(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: SO base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: MO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
GCAACAACAA GAACCAAGTT ACATACACGT TCATCTATAC TGAACCCCCA 
<2) INTORMATION FOR SEQ ID NO:95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

5 CCTTTGAGTT CCTAAATGCC GCACGGTAAG CTTGQCACAC TTTGACTGTA 50 

(2) INFORMATION FOR SSQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

i 

15 (ii) MOLECULE TYPE: DNA (genomic) 

(lii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



20 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 



CAAAGATCTC ACTTTGQAAA T6CGAAATAT GTATATTCGC CCTGTCTGC 49 



(2) INFORMATION FOR SEQ ID NO:97: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
<iv) ANTI-SENSE: NO 

35 (xi) SEQUENCE DESC31IPTI0N : SEQ ID NO: 97: 

CCACGTAGAA TTATCTGATT TATAACATAA CGCAGGATAA CTCTCGCCCA 50 
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(2) INFORMATION FOR SEQ ID NO: 98: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 48 base paira 
<B> TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY s linear 



(ii) MOLECULE TYPE: DNA (genomic) 
10 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
15 CACAAGAAAG TGTCGTCTCC AGATATTTGA GTACAAGGAA CTACGCCC 



(2) INFORMATION FOR SEQ ID NO: 93: 



ix) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
CATGAAGAAA TAGGACATTC TACAGGCTGG ACCGTTACTA TGCCTGTAGQ SO 



(2) INFORMATION FOR SEQ ID NO:100: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLEOTLB TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
CATAGGATAA TCATGGCGAT GCTTATGACG TQTACATCTA TACCTT 46 
(2} INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomiq) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 101: 



CAGATGATCT TCCTTTAAAG ACTACCCTTT AAAGAAACAT AAGGTACCCC 



50 
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We Claim: 

1 . A catalytic DNA molecule having site-specific endonuclease activity. 

2. The catalytic DMA molecule of claim 1, wherein said endonuclease 
activity is specific for a nucleotide sequence defining a cleavage site comprising single- 

5 stranded nucleic ecid in a substrata nucleic acid sequence. 

3. The catalytic DNA molecule of claim 2, wherein said single stranded 
nucleic acid comprises RNA, DNA. modified RNA. modified DNA, nucleotide analogs, or 
composites thereof. 

4. The catalytic DNA molecule of claim 2, wherein said substrate nucleic 
10 acid comprises RNA, DNA, modified RNA, modified DNA, nucleotide analogs, or 

composites thereof. 

5. The catalytic DNA molecule of claim 2, wherein said endonucteese 
activity comprises hydrolytic cleavage of a phosphoester bond at said cleavage site. 

6. The catalytic DNA molecule of claim 1, wherein said molecule Is single- 

15 stranded. 

7. The catalytic DNA molecule of claim 1, wherein said molecule includes 

one or more hairpin loop structures. 

8. The catalytic DNA molecule of claim 1. wherein said substrate nucleic 
acid sequence is attached to said catalytic DNA molecule. 

20 9. The catalytic DNA molecule of claim 1, wherein said substrate nucleic 

acid sequence is not attached to sard catalytic DNA molecule. 

10. The catalytic DNA molecule of claim 1, wherein said catalytic DNA 
molecule comprises a nucleotide sequence selected from the group consisting of: 
5EQ ID NO 3 and SEQ ID NOS 14 through 22. 
25 11. The catalytic DNA molecule of daim 1. wherein said catalytic DNA 

molecule comprises a nucleotide sequence selected from the group consisting of: 
SEQ ID NOS 23 through 30. 

12. The catalytic DNA molecule of claim 1, wherein said catalytic DNA 
molecule comprises a nucleotide sequence selected from the group consisting of: 

30 SEQ ID NOS 31 through 39. 

1 3. The catalytic DNA molecule of claim 1 , wherein said catalytic DNA 
molecule comprises a nucleotide sequence selected from the group consisting of: 

SEQ ID NOS 52 through 101. 

14. The catalytic DNA molecule of claim 11, 12, or 13, wherein said 
35 endonuclease activity is enhanced by the presence of IVlg**. 

1 5. The catalytic DNA molecule of claim 1 , wherein said catalytic DNA 
molecule has a substrate binding affinity of about 1 fM or less. 
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IC. The catalytic DNA molecule of claim 1, wherein said catalytic DNA 
molecule binds substrate with a of less than about 0.1 fjM. 

17. The catalytic DNA molecule of claim 2, wherein said nucleotide 
sequence defining said cleavage site comprises at least one nucleotide. 

18. The catalytic DNA molecule of claim 1, wherein said endonuclease 
activity is enhanced by the presence of a dhralent cation. 

19. The catalytic DNA molecule of claim 18, wherein said divalent cation is 
selected from the group consisting of Pb'*, Mg'*, Mn'*, Zn^*, and Ca'*. 

20. The catalytic DNA molecule of claim 1, wherein said endonuclease 
activity is enhanced by the presence of a monovalent cation. 

21. The catalytic DNA molecule of claim 20, wherein said monovalent cation 
is selected from the group consisting of Na'^ and K*^. 

22. The catalytic DNA molecule of claim 1, wherein said catalytic DNA 
molecule comprises a conserved core flanked by first and second substrate binding 
regions. 

23. The catalytic DNA molecule of claim 22, further comprising one or more 
spacer nucleotides between said conserved core and said substrate binding region. 

24. The catalytic DNA molecule of claim 22, wherein said conserved core 
comprises one or more conserved regions. 

25. The catalytic DNA molecule of daim 24, wherein said one or more 
conserved regions includes a nucleotide sequence selected from the group consisting of: 

CG; 

CGA: 

AGCG; 

AGCCG; 

CAGCGAT; 

CTTGTTT; and 

CTTATTT. 

26. The catalytic DNA molecule of claim 24, further comprising one or more 
variable or spacer nucleotides between said conserved regions in said conserved core. 

27. The catalytic DNA molecule of claim 22, wherein said first substrate 
binding region includes a nucleotide sequence selected from the group consisting of: 

CATCTCT; 

GCTCT; 

TTGCTTTTT; 

TGTCTTCTC; 

TTGCTGCT; 
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GCCATGCTTT; 
CTCTATTTCT 
GTCGGCA; 
CATGTCTTC; and 
5 ACTTCT, 

28. The catalytic DNA molecule of daim 22, wherein said second substrate 
binding region includes a nucleotide sequence selected from the group consisting of: 

TATGTGACGCTA; 

TATAGTCQTA; 
10 ATAGCGTATTA; 

ATAGTTACGTCAT; 

AATAGTGAAGTGTT; 

TATAGT6TA; 

ATAQTCGQT; 
1 5 ATAGQCCCGGT; 

AATAGTGAGGCTTG; and 

ATGNTG. 

29. The catalytic DNA molecule of claim 22, further comprising a third 
substrate binding region, wherein said third region includes a nucleotide sequence 

20 selected from the group consisting of: 
TGTT; 
TGTTA; and 
TGTTAG. 

30. The catalytic DNA molecule of claim 29, further comprising one or more 
25 spacer regions between said substrate binding regions. 

31 . A composition comprising two or more populations of catalytic DNA 
molecules according to daim 1, wherein each population of catalytic DNA molecules is 
capable of cleaving a different nucleotide sequence in a substrate. 

32. A composition comprising two or more populations of catalytic DNA 
30 molecules according to claim 1, wherein each population of catalytic DNA molecules is 

capable of recognizing a differertt substrate. 

33. A method of selecting a catalytic DMA molecule that cleaves a substrate 
nucleic acid sequence at a specific site, comprising the following steps: 

a. obtaining a poputation of single-stranded DNA molecules; 
35 b, admixing nucleotlde-containing substrate molecules with said population 

of single-stranded DNA molecules to form an admixture; 
c. maintaining said admixture for a sufficient period of time and under 
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predetermined reaction conditions to allow single-stranded DNA molecules in 
said population to cause cleavage of said substrate sequences, thereby 
producing substrate cleavage products; 

d. separating said population of single-stranded DNA molecules from said 
substrate sequences and substrate cleavage products; and 

e. isolating single-stranded DMA molecules that cleave nucleotide- 
containing substrate at a specific site from said population. 

34. The method of claim 33, wherein said substrate comprises RNA. 

35. The method of claim 33, wherein said DNA molecules that cleave said 
substrate at a specific site are tagged with an immobilizing agent. 

36. The method of claim 35, wherein said agent comprises biotin. 

37. The method of claim 35, wherein said Isolating step funher comprises 
exposing said tagged DNA molecules to a solid surface having avidin linked thereto, 
whereby said tagged DNA molecules become attached to said solid surface. 

38. A method of cleaving a phosphoester bond, comprising: 

a. admixing a catalytic DNA molecule capable of cleaving a substrate 
nucleic acid sequence at a defined cleavage site with a phosphoester bond- 
containing substrate, to form a reaction admixture; and 

b. maintaining said admixture under predetermined reaction conditions to 
allow said catalytic DNA molecule to cleave said phosphoester bond, thereby 
producing a population of substrate products. 

39. The method of daim 38, further comprising the steps of 

a. separating said products from said catalytic DNA molecule; and 

b. adding additional substrate to said catalytic DNA molecule to form a new 
reaction admixture. 

40. The method of claim 38, wherein said substrate comprises RNA. 

41. The method of claim 38, wherein said predetermined reaction conditions 
include the presence of a monovalent cation, a divalent cation, or both. 

42. A method of engineering catalytic DNA molecules that cleave 
phosphoester bonds, comprising the following steps: 

a. obtaining a population of single-stranded DNA molecules; 

b. introducing genetic variation into said population to produce a variant 
population; 

c. selecting individuals from said variant population that meet 
predetermined selection criteria; 

d. separating said selected Individuals from the remainder of said variant 
population; and 
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e. amplifytng said selected individuals. 

43. A non-naturally-occurring cataiytic DNA molecule comprising a 
nucleotide sequence defining a conserved core flanked by one or more recognition 
domains, variable regions* and spacer regions. 
5 44. The catalytic DNA molecule of claim 43, wherein said nucleotide 

sequence defines a first variable region contiguous or adiacent to the 5'-terminus of the 
molecule, a first recognition domain located 3'-termlnal to the first variable region, a first 
spacer region located 3'-terminai to the first recognition domain, a first conserved region 
located 3'-termlnal to the first spacer region, a second spacer region located 3'-terminal 

10 to the first conserved region, a second conserved region located 3'-termlnal to the 

second spacer region, a second recognition domain located 3'-terminal to the second 
conserved region, and a second variable region located 3' -terminal to the second 
recognition domain. 

45. The catalytic DNA molecule of claim 43, wherein said nucleotide 

15 sequence defines a first variable region contiguous or adjacent to the 5'-terminus of the 

molecule, a first recognition domain located 3*-terminal to the first variable region, a first 
spacer region located 3* -terminal to the first recognition domain, a first conserved region 
located 3*-terminal to the first spacer region, a second spacer region located 3'-termlnal 
to the first conserved region, a second conserved region located 3*-termlnal to the 

20 second spacer region, a second recognition domain located 3*-termlnal to the second 

conserved region, a second variable region located 3*-terminal to the second recognition 
domain, and a third recognition domain located 3' -terminal to the second variable region. 
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